NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958747106|ref|XP_038953843|]
View 

ankyrin repeat domain-containing protein 11 isoform X3 [Rattus norvegicus]

Protein Classification

ankyrin repeat domain-containing protein( domain architecture ID 13791050)

ankyrin (ANK) repeat domain-containing protein may be involved in mediating protein-protein interactions

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
41-127 9.11e-24

Ankyrin repeat [Signal transduction mechanisms];


:

Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 103.88  E-value: 9.11e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NSP 117
Cdd:COG0666    120 GETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGADVNARDNDGETPLHLAaenGHL 199
                           90
                   ....*....|
gi 1958747106  118 TMVNLLLGKG 127
Cdd:COG0666    200 EIVKLLLEAG 209
PTZ00121 super family cl31754
MAEBL; Provisional
572-1474 1.90e-16

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 86.73  E-value: 1.90e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  572 ERDRPSKTERERSTKEKSPKEEKlrlYKEERKKKSKDRPFKLEKkndMKEVSKEKEKAFREDKEKLKKEKLCRDDAAFDD 651
Cdd:PTZ00121  1070 EGLKPSYKDFDFDAKEDNRADEA---TEEAFGKAEEAKKTETGK---AEEARKAEEAKKKAEDARKAEEARKAEDARKAE 1143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  652 YCNKSQfldhEDTKFSLSDDQQERWFSDLSDSSFDFKGEDSWDSVTDYRdiKSDSVAKLILETVKEDSKEKKRDNKTREK 731
Cdd:PTZ00121  1144 EARKAE----DAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVR--KAEELRKAEDARKAEAARKAEEERKAEEA 1217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  732 RDFRDSffrkRDRDCVDRNSEKRRDHTEKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERR--DGRIRSEEVHR-E 808
Cdd:PTZ00121  1218 RKAEDA----KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARkaDELKKAEEKKKaD 1293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  809 DLKEcgcdSTFKDKSDcdftktlepwERPHAAREKEKKDALEKDRKE-KGRAEKYKDKSGERERNEKSILEKCQKDKEFE 887
Cdd:PTZ00121  1294 EAKK----AEEKKKAD----------EAKKKAEEAKKADEAKKKAEEaKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA 1359
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  888 KCFKEKKDGKEKHKDTHSKDRKTPFDQLREKKEKAFSSLISEDFSERKDDRKGKEKSWYIADiftdESEDEKEECVASSF 967
Cdd:PTZ00121  1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD----EAKKKAEEKKKADE 1435
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  968 KTGETGDSQRAESLQEKEDGREHPSDRHRKASSDRQHTEKPRDKEPKEKRKD-RGAAEGGKDKKEKIFEKHKEKKDKECA 1046
Cdd:PTZ00121  1436 AKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEaKKKAEEAKKKADEAKKAAEAKKKADEA 1515
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1047 EKYKERKDRASVDSAPEKKNKQKLPEKVEKKHFVE-------DKAKSKHK-EKPEKDHSRERKSSRGPDVEKSLLEKLEE 1118
Cdd:PTZ00121  1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADElkkaeelKKAEEKKKaEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1119 EALHDYREDSNDKISEVSSDSFADHGQEpSLSTLLEVSFSEPPAEDKARESTCLSEKLKERERERHRHSSSSSKKSHERE 1198
Cdd:PTZ00121  1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAE-ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1199 RAKKekadkkekgEDYKDGGGGRKDASQYEKDfadAEAFGGSYTTKADTEEDLDKAIELFSSEKKDRNDSER-----EPA 1273
Cdd:PTZ00121  1675 KKAE---------EAKKAEEDEKKAAEALKKE---AEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEakkeaEED 1742
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1274 KKLEKELKPYGSSTISILKEKKKREKHREKWREEKEKHRDKHIDGFLRHHKDEPKPAAKD-KDNPPNcFKEKSREESLKL 1352
Cdd:PTZ00121  1743 KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDiFDNFAN-IIEGGKEGNLVI 1821
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1353 SEAKLKE--KFKE-----NAEREKGDSVKMSNGNDKPLPSRDANKKDSRPREKLLGDGDLmmtsfERMLSQKdlEIEERH 1425
Cdd:PTZ00121  1822 NDSKEMEdsAIKEvadskNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDE-----EEIEEAD--EIEKID 1894
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....*....
gi 1958747106 1426 KRHKERMKQMEKMRHRSGDpKLKEKKPTDDGRKKSLDFPSKKALGLDKK 1474
Cdd:PTZ00121  1895 KDDIEREIPNNNMAGKNND-IIDDKLDKDEYIKRDAEETREEIIKISKK 1942
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1698-2209 3.33e-12

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 72.66  E-value: 3.33e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1698 PPPDSvfSNLPPKSSPSPrgELLTPAIEG-ALPPDL-----GLPLDATEDQQATAAILPPEPSylepldegpfntvitee 1771
Cdd:PHA03247  2502 GPPDP--DAPPAPSRLAP--AILPDEPVGePVHPRMltwirGLEELASDDAGDPPPPLPPAAP----------------- 2560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1772 pvewtHSAAEQSLPSSLIASASETPVswpVGSELmlKSPQRFAESPKHFCPGEPlhsttPGPFSAAEPTYPVSPGSYPL- 1850
Cdd:PHA03247  2561 -----PAAPDRSVPPPRPAPRPSEPA---VTSRA--RRPDAPPQSARPRAPVDD-----RGDPRGPAPPSPLPPDTHAPd 2625
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1851 ---PAPEPALEEVKDGGTGAIPVAIAAAEGAAPYTAPTRLESFFSNCKPHPDAPLDTAPEPASVTTVAQVEALG---PLE 1924
Cdd:PHA03247  2626 pppPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdppPPP 2705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1925 SSFLDSSHSISALSQVEPVSWHEAFTSPEDDLDLGPFSLPELPlqAKDASDVEAETAEASPAPPVESPP-GPTGVlsggd 2003
Cdd:PHA03247  2706 PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP--ATPGGPARPARPPTTAGPPAPAPPaAPAAG----- 2778
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2004 vPASTTEEPPAPPPQEASPQLSTEPEPSEETKldVVLEAAAETEVLADDSAPEASISNLVPAPSPPEQQRP--------- 2074
Cdd:PHA03247  2779 -PPRRLTRPAVASLSESRESLPSPWDPADPPA--AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplggs 2855
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2075 -AGGGDEETEAEDPSAAPC-CAPDGPTTDGLAQAPNSAEAACVVAAVEGPPGNIQPEA-----------TDPEPKPTSEA 2141
Cdd:PHA03247  2856 vAPGGDVRRRPPSRSPAAKpAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAppppqpqpqppPPPQPQPPPPP 2935
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958747106 2142 PKAPKVEEVPQRMTRNRAQMLASQSKQGIPATEKDSMPAPASRAKGRAPEEEDAQAQHPRKRRFQRSG 2209
Cdd:PHA03247  2936 PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1442-1787 7.94e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 7.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1442 SGDPKLKEKKPTDDGRKKSLDFPSKKALGLDKKVKEPAPVLPTGEGKPhSGPGTESKDWLAGQPLKEVLPASPRTEQGRP 1521
Cdd:PHA03247  2698 LADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP-AGPATPGGPARPARPPTTAGPPAPAPPAAPA 2776
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1522 TGVP---TPTSVVSCPSYEEVMHTPRTPSCSADDYPDLVFDCTDSQHSMPVSTTSTSAC-------SPPFFDRFSVASSV 1591
Cdd:PHA03247  2777 AGPPrrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQptappppPGPPPPSLPLGGSV 2856
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1592 V--AENAGQTPTRPISTNLYRSISVDIRRTPEEEFSVGDKLFRQ---------QSVPAASSFDSPVQHLLEEKAPLPPVP 1660
Cdd:PHA03247  2857 ApgGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALppdqperppQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1661 AEKFACLSPEYYSPDYGIPSPKVDTLH----CPPTAVVSATPPPDSVFSNLPPKSSPSPRGELLTPAI----------EG 1726
Cdd:PHA03247  2937 PRPQPPLAPTTDPAGAGEPSGAVPQPWlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVsswasslalhEE 3016
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 1727 ALPPDLGL-----PLDATEDQQATAAILP-PEPSYLEPLD--EGPFNTVITEEPVEWTHSAAEQSLPSS 1787
Cdd:PHA03247  3017 TDPPPVSLkqtlwPPDDTEDSDADSLFDSdSERSDLEALDplPPEPHDPFAHEPDPATPEAGARESPSS 3085
 
Name Accession Description Interval E-value
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
41-127 9.11e-24

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 103.88  E-value: 9.11e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NSP 117
Cdd:COG0666    120 GETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGADVNARDNDGETPLHLAaenGHL 199
                           90
                   ....*....|
gi 1958747106  118 TMVNLLLGKG 127
Cdd:COG0666    200 EIVKLLLEAG 209
PTZ00121 PTZ00121
MAEBL; Provisional
572-1474 1.90e-16

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 86.73  E-value: 1.90e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  572 ERDRPSKTERERSTKEKSPKEEKlrlYKEERKKKSKDRPFKLEKkndMKEVSKEKEKAFREDKEKLKKEKLCRDDAAFDD 651
Cdd:PTZ00121  1070 EGLKPSYKDFDFDAKEDNRADEA---TEEAFGKAEEAKKTETGK---AEEARKAEEAKKKAEDARKAEEARKAEDARKAE 1143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  652 YCNKSQfldhEDTKFSLSDDQQERWFSDLSDSSFDFKGEDSWDSVTDYRdiKSDSVAKLILETVKEDSKEKKRDNKTREK 731
Cdd:PTZ00121  1144 EARKAE----DAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVR--KAEELRKAEDARKAEAARKAEEERKAEEA 1217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  732 RDFRDSffrkRDRDCVDRNSEKRRDHTEKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERR--DGRIRSEEVHR-E 808
Cdd:PTZ00121  1218 RKAEDA----KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARkaDELKKAEEKKKaD 1293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  809 DLKEcgcdSTFKDKSDcdftktlepwERPHAAREKEKKDALEKDRKE-KGRAEKYKDKSGERERNEKSILEKCQKDKEFE 887
Cdd:PTZ00121  1294 EAKK----AEEKKKAD----------EAKKKAEEAKKADEAKKKAEEaKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA 1359
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  888 KCFKEKKDGKEKHKDTHSKDRKTPFDQLREKKEKAFSSLISEDFSERKDDRKGKEKSWYIADiftdESEDEKEECVASSF 967
Cdd:PTZ00121  1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD----EAKKKAEEKKKADE 1435
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  968 KTGETGDSQRAESLQEKEDGREHPSDRHRKASSDRQHTEKPRDKEPKEKRKD-RGAAEGGKDKKEKIFEKHKEKKDKECA 1046
Cdd:PTZ00121  1436 AKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEaKKKAEEAKKKADEAKKAAEAKKKADEA 1515
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1047 EKYKERKDRASVDSAPEKKNKQKLPEKVEKKHFVE-------DKAKSKHK-EKPEKDHSRERKSSRGPDVEKSLLEKLEE 1118
Cdd:PTZ00121  1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADElkkaeelKKAEEKKKaEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1119 EALHDYREDSNDKISEVSSDSFADHGQEpSLSTLLEVSFSEPPAEDKARESTCLSEKLKERERERHRHSSSSSKKSHERE 1198
Cdd:PTZ00121  1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAE-ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1199 RAKKekadkkekgEDYKDGGGGRKDASQYEKDfadAEAFGGSYTTKADTEEDLDKAIELFSSEKKDRNDSER-----EPA 1273
Cdd:PTZ00121  1675 KKAE---------EAKKAEEDEKKAAEALKKE---AEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEakkeaEED 1742
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1274 KKLEKELKPYGSSTISILKEKKKREKHREKWREEKEKHRDKHIDGFLRHHKDEPKPAAKD-KDNPPNcFKEKSREESLKL 1352
Cdd:PTZ00121  1743 KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDiFDNFAN-IIEGGKEGNLVI 1821
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1353 SEAKLKE--KFKE-----NAEREKGDSVKMSNGNDKPLPSRDANKKDSRPREKLLGDGDLmmtsfERMLSQKdlEIEERH 1425
Cdd:PTZ00121  1822 NDSKEMEdsAIKEvadskNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDE-----EEIEEAD--EIEKID 1894
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....*....
gi 1958747106 1426 KRHKERMKQMEKMRHRSGDpKLKEKKPTDDGRKKSLDFPSKKALGLDKK 1474
Cdd:PTZ00121  1895 KDDIEREIPNNNMAGKNND-IIDDKLDKDEYIKRDAEETREEIIKISKK 1942
Ank_2 pfam12796
Ankyrin repeats (3 copies);
45-127 3.22e-16

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 75.92  E-value: 3.22e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   45 LHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYggNPQQSNRKGETPLKVA---NSPTMVN 121
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEH--ADVNLKDNGRTALHYAarsGHLEIVK 78

                   ....*.
gi 1958747106  122 LLLGKG 127
Cdd:pfam12796   79 LLLEKG 84
PHA03247 PHA03247
large tegument protein UL36; Provisional
1698-2209 3.33e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 72.66  E-value: 3.33e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1698 PPPDSvfSNLPPKSSPSPrgELLTPAIEG-ALPPDL-----GLPLDATEDQQATAAILPPEPSylepldegpfntvitee 1771
Cdd:PHA03247  2502 GPPDP--DAPPAPSRLAP--AILPDEPVGePVHPRMltwirGLEELASDDAGDPPPPLPPAAP----------------- 2560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1772 pvewtHSAAEQSLPSSLIASASETPVswpVGSELmlKSPQRFAESPKHFCPGEPlhsttPGPFSAAEPTYPVSPGSYPL- 1850
Cdd:PHA03247  2561 -----PAAPDRSVPPPRPAPRPSEPA---VTSRA--RRPDAPPQSARPRAPVDD-----RGDPRGPAPPSPLPPDTHAPd 2625
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1851 ---PAPEPALEEVKDGGTGAIPVAIAAAEGAAPYTAPTRLESFFSNCKPHPDAPLDTAPEPASVTTVAQVEALG---PLE 1924
Cdd:PHA03247  2626 pppPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdppPPP 2705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1925 SSFLDSSHSISALSQVEPVSWHEAFTSPEDDLDLGPFSLPELPlqAKDASDVEAETAEASPAPPVESPP-GPTGVlsggd 2003
Cdd:PHA03247  2706 PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP--ATPGGPARPARPPTTAGPPAPAPPaAPAAG----- 2778
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2004 vPASTTEEPPAPPPQEASPQLSTEPEPSEETKldVVLEAAAETEVLADDSAPEASISNLVPAPSPPEQQRP--------- 2074
Cdd:PHA03247  2779 -PPRRLTRPAVASLSESRESLPSPWDPADPPA--AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplggs 2855
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2075 -AGGGDEETEAEDPSAAPC-CAPDGPTTDGLAQAPNSAEAACVVAAVEGPPGNIQPEA-----------TDPEPKPTSEA 2141
Cdd:PHA03247  2856 vAPGGDVRRRPPSRSPAAKpAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAppppqpqpqppPPPQPQPPPPP 2935
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958747106 2142 PKAPKVEEVPQRMTRNRAQMLASQSKQGIPATEKDSMPAPASRAKGRAPEEEDAQAQHPRKRRFQRSG 2209
Cdd:PHA03247  2936 PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
PHA02878 PHA02878
ankyrin repeat protein; Provisional
43-128 1.41e-08

ankyrin repeat protein; Provisional


Pssm-ID: 222939 [Multi-domain]  Cd Length: 477  Bit Score: 59.89  E-value: 1.41e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   43 TALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVANSP----T 118
Cdd:PHA02878   170 TALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGNTPLHISVGYckdyD 249
                           90
                   ....*....|
gi 1958747106  119 MVNLLLGKGT 128
Cdd:PHA02878   250 ILKLLLEHGV 259
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1653-2029 2.23e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 53.23  E-value: 2.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1653 KAPLPPVPAekfaCLSPEYYSPDYGIPSPKVDTLHCPPTAVVSATPPPDSVFSNLPPKSSPSPRGELLTPAIEGALPPDL 1732
Cdd:pfam03154  175 QAQSGAASP----PSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQ 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1733 GLPLDATEDQQATAAIlpPEPSYLEPLDEGPFNtvITEEPVEWTHSAAEQSLPSSLIASASETPVSwpvgselmlksPQR 1812
Cdd:pfam03154  251 PMTQPPPPSQVSPQPL--PQPSLHGQMPPMPHS--LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPG-----------PSP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1813 FAESPKHFCPgeplhsTTPGPFSAAEPTYPvsPGSYPLPAPEPALEEVKDGGTGAIPVAIAAAEGAAP--YTAPTRLeSF 1890
Cdd:pfam03154  316 AAPGQSQQRI------HTPPSQSQLQSQQP--PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPphLSGPSPF-QM 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1891 FSNCKPHP----------DAPLDTAPEPASVTTVAQVEALGPLESSFLDSSHSISALSQVEPVSWHEAFTSPEDDLDLGP 1960
Cdd:pfam03154  387 NSNLPPPPalkplsslstHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHP 466
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 1961 FsLPELPLQAKDASDVEAETaeaSPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQLSTEPE 2029
Cdd:pfam03154  467 F-VPGGPPPITPPSGPPTST---SSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPE 531
PHA03247 PHA03247
large tegument protein UL36; Provisional
1442-1787 7.94e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 7.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1442 SGDPKLKEKKPTDDGRKKSLDFPSKKALGLDKKVKEPAPVLPTGEGKPhSGPGTESKDWLAGQPLKEVLPASPRTEQGRP 1521
Cdd:PHA03247  2698 LADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP-AGPATPGGPARPARPPTTAGPPAPAPPAAPA 2776
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1522 TGVP---TPTSVVSCPSYEEVMHTPRTPSCSADDYPDLVFDCTDSQHSMPVSTTSTSAC-------SPPFFDRFSVASSV 1591
Cdd:PHA03247  2777 AGPPrrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQptappppPGPPPPSLPLGGSV 2856
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1592 V--AENAGQTPTRPISTNLYRSISVDIRRTPEEEFSVGDKLFRQ---------QSVPAASSFDSPVQHLLEEKAPLPPVP 1660
Cdd:PHA03247  2857 ApgGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALppdqperppQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1661 AEKFACLSPEYYSPDYGIPSPKVDTLH----CPPTAVVSATPPPDSVFSNLPPKSSPSPRGELLTPAI----------EG 1726
Cdd:PHA03247  2937 PRPQPPLAPTTDPAGAGEPSGAVPQPWlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVsswasslalhEE 3016
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 1727 ALPPDLGL-----PLDATEDQQATAAILP-PEPSYLEPLD--EGPFNTVITEEPVEWTHSAAEQSLPSS 1787
Cdd:PHA03247  3017 TDPPPVSLkqtlwPPDDTEDSDADSLFDSdSERSDLEALDplPPEPHDPFAHEPDPATPEAGARESPSS 3085
ANK smart00248
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ...
73-100 1.50e-05

ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.


Pssm-ID: 197603 [Multi-domain]  Cd Length: 30  Bit Score: 43.73  E-value: 1.50e-05
                            10        20
                    ....*....|....*....|....*...
gi 1958747106    73 DDDTPLHDAANNGHYKVVKLLLRYGGNP 100
Cdd:smart00248    1 DGRTPLHLAAENGNLEVVKLLLDKGADI 28
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
503-1103 1.56e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 50.74  E-value: 1.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  503 EYEDSKQKPDKAILLENDVSTENKLKVLKH----------DRDKLSKTKSEDKEWLFKDEKALKRMKDVSKDTSRAFREE 572
Cdd:pfam02463  202 LKEQAKKALEYYQLKEKLELEEEYLLYLDYlklneeridlLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEK 281
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  573 rdrpSKTERERSTKEKSPKEEKLRLYKEERKKKSKDRPFKL-EKKNDMKEVSKEKEKAFREDKEKLKKEKLcRDDAAFDd 651
Cdd:pfam02463  282 ----KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKEsEKEKKKAEKELKKEKEEIEELEKELKELE-IKREAEE- 355
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  652 ycNKSQFLDHEDTKFSLSDDQQERWFSDLSDSSFDFKGEDSWDSVTDYRDIKSDSVAKLILETVKEDSKE--KKRDNKTR 729
Cdd:pfam02463  356 --EEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEekKEELEILE 433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  730 EKRDFRDSffrKRDRDCVDRNSEKRRDHTEKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERRDGRIRSEEVHRED 809
Cdd:pfam02463  434 EEEESIEL---KQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLK 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  810 LKECGCDSTFKDKSDCDFTKTlepwERPHAAREKEK------------KDALEKDRKEKGRAEKYKDKSGERERNEKSIL 877
Cdd:pfam02463  511 VLLALIKDGVGGRIISAHGRL----GDLGVAVENYKvaistavivevsATADEVEERQKLVRALTELPLGARKLRLLIPK 586
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  878 EKCQKDKEFEK-----CFKEKKDGKEKHKDTHSKDRKTPFDQLREKKEK--------------AFSSLISEDFSERKDDR 938
Cdd:pfam02463  587 LKLPLKSIAVLeidpiLNLAQLDKATLEADEDDKRAKVVEGILKDTELTklkesakakesglrKGVSLEEGLAEKSEVKA 666
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  939 KGKEKSWYIADIftdESEDEKEECVASSFKTGETGDSQRAESLQEKEDGREHPSDRHRKASSDRQHTEKPRDKEPKEKRK 1018
Cdd:pfam02463  667 SLSELTKELLEI---QELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQ 743
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1019 DRGAAEggkDKKEKIFEKHKEKKDKECAEKYKERKDRASVDSAPEKKNKQKLPEKVEKKHFVEDKAKSKHKEKPEKDHSR 1098
Cdd:pfam02463  744 KIDEEE---EEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEE 820

                   ....*
gi 1958747106 1099 ERKSS 1103
Cdd:pfam02463  821 QLLIE 825
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
1895-2196 3.68e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 45.82  E-value: 3.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1895 KPHPDAPLDTAPEPASVTTVAQVEALGPLESsfldsshsISALSQVEPVSWHEAFTSPEDDLDLGPFSLPELPLQAKDAS 1974
Cdd:COG5180     77 VAEPEAYLDPAPPKSSPDTPEEQLGAPAGDL--------LVLPAAKTPELAAGALPAPAAAAALPKAKVTREATSASAGV 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1975 DVEAetAEASPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQLSTEPEPSEETKlDVVLEAAAETEVLADDSA 2054
Cdd:COG5180    149 ALAA--ALLQRSDPILAKDPDGDSASTLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVK-DEAQEEPPDLTGGADHPR 225
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2055 PEASISNLVPAPSPPEQQRPAGGGDEETEAEDPSAAPCCAPDGPTTDGLAQAPNSAEAACVVAAVEGPPGNIQPEATDpe 2134
Cdd:COG5180    226 PEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPID-- 303
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 2135 PKPTSEAPKAPKVEEVPQRMTRNRAQM--LASQSKQGIPATEKD-----SMPAPASRAKGRAPEEEDAQ 2196
Cdd:COG5180    304 VKGVASAPPATRPVRPPGGARDPGTPRpgQPTERPAGVPEAASDagqppSAYPPAEEAVPGKPLEQGAP 372
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
716-802 1.21e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 44.11  E-value: 1.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  716 KEDSKEKKRDnKTREKRDFRDSFFRKRDRDCV--DRNSEKRRDHTEKQRSFpsylsEKDKKRRESAEGGRDRRDTLEGSR 793
Cdd:TIGR01642   17 RDRSSERPRR-RSRDRSRFRDRHRRSRERSYRedSRPRDRRRYDSRSPRSL-----RYSSVRRSRDRPRRRSRSVRSIEQ 90

                   ....*....
gi 1958747106  794 ERRDGRIRS 802
Cdd:TIGR01642   91 HRRRLRDRS 99
 
Name Accession Description Interval E-value
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
41-127 9.11e-24

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 103.88  E-value: 9.11e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NSP 117
Cdd:COG0666    120 GETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGADVNARDNDGETPLHLAaenGHL 199
                           90
                   ....*....|
gi 1958747106  118 TMVNLLLGKG 127
Cdd:COG0666    200 EIVKLLLEAG 209
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
41-127 5.54e-22

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 98.87  E-value: 5.54e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NSP 117
Cdd:COG0666     87 GNTLLHAAARNGDLEIVKLLLEAGADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAaanGNL 166
                           90
                   ....*....|
gi 1958747106  118 TMVNLLLGKG 127
Cdd:COG0666    167 EIVKLLLEAG 176
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
41-127 2.05e-21

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 96.95  E-value: 2.05e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NSP 117
Cdd:COG0666    153 GNTPLHLAAANGNLEIVKLLLEAGADVNARDNDGETPLHLAAENGHLEIVKLLLEAGADVNAKDNDGKTALDLAaenGNL 232
                           90
                   ....*....|
gi 1958747106  118 TMVNLLLGKG 127
Cdd:COG0666    233 EIVKLLLEAG 242
PTZ00121 PTZ00121
MAEBL; Provisional
572-1474 1.90e-16

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 86.73  E-value: 1.90e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  572 ERDRPSKTERERSTKEKSPKEEKlrlYKEERKKKSKDRPFKLEKkndMKEVSKEKEKAFREDKEKLKKEKLCRDDAAFDD 651
Cdd:PTZ00121  1070 EGLKPSYKDFDFDAKEDNRADEA---TEEAFGKAEEAKKTETGK---AEEARKAEEAKKKAEDARKAEEARKAEDARKAE 1143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  652 YCNKSQfldhEDTKFSLSDDQQERWFSDLSDSSFDFKGEDSWDSVTDYRdiKSDSVAKLILETVKEDSKEKKRDNKTREK 731
Cdd:PTZ00121  1144 EARKAE----DAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVR--KAEELRKAEDARKAEAARKAEEERKAEEA 1217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  732 RDFRDSffrkRDRDCVDRNSEKRRDHTEKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERR--DGRIRSEEVHR-E 808
Cdd:PTZ00121  1218 RKAEDA----KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARkaDELKKAEEKKKaD 1293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  809 DLKEcgcdSTFKDKSDcdftktlepwERPHAAREKEKKDALEKDRKE-KGRAEKYKDKSGERERNEKSILEKCQKDKEFE 887
Cdd:PTZ00121  1294 EAKK----AEEKKKAD----------EAKKKAEEAKKADEAKKKAEEaKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA 1359
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  888 KCFKEKKDGKEKHKDTHSKDRKTPFDQLREKKEKAFSSLISEDFSERKDDRKGKEKSWYIADiftdESEDEKEECVASSF 967
Cdd:PTZ00121  1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD----EAKKKAEEKKKADE 1435
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  968 KTGETGDSQRAESLQEKEDGREHPSDRHRKASSDRQHTEKPRDKEPKEKRKD-RGAAEGGKDKKEKIFEKHKEKKDKECA 1046
Cdd:PTZ00121  1436 AKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEaKKKAEEAKKKADEAKKAAEAKKKADEA 1515
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1047 EKYKERKDRASVDSAPEKKNKQKLPEKVEKKHFVE-------DKAKSKHK-EKPEKDHSRERKSSRGPDVEKSLLEKLEE 1118
Cdd:PTZ00121  1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADElkkaeelKKAEEKKKaEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1119 EALHDYREDSNDKISEVSSDSFADHGQEpSLSTLLEVSFSEPPAEDKARESTCLSEKLKERERERHRHSSSSSKKSHERE 1198
Cdd:PTZ00121  1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAE-ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1199 RAKKekadkkekgEDYKDGGGGRKDASQYEKDfadAEAFGGSYTTKADTEEDLDKAIELFSSEKKDRNDSER-----EPA 1273
Cdd:PTZ00121  1675 KKAE---------EAKKAEEDEKKAAEALKKE---AEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEakkeaEED 1742
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1274 KKLEKELKPYGSSTISILKEKKKREKHREKWREEKEKHRDKHIDGFLRHHKDEPKPAAKD-KDNPPNcFKEKSREESLKL 1352
Cdd:PTZ00121  1743 KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDiFDNFAN-IIEGGKEGNLVI 1821
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1353 SEAKLKE--KFKE-----NAEREKGDSVKMSNGNDKPLPSRDANKKDSRPREKLLGDGDLmmtsfERMLSQKdlEIEERH 1425
Cdd:PTZ00121  1822 NDSKEMEdsAIKEvadskNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDE-----EEIEEAD--EIEKID 1894
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....*....
gi 1958747106 1426 KRHKERMKQMEKMRHRSGDpKLKEKKPTDDGRKKSLDFPSKKALGLDKK 1474
Cdd:PTZ00121  1895 KDDIEREIPNNNMAGKNND-IIDDKLDKDEYIKRDAEETREEIIKISKK 1942
Ank_2 pfam12796
Ankyrin repeats (3 copies);
45-127 3.22e-16

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 75.92  E-value: 3.22e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   45 LHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYggNPQQSNRKGETPLKVA---NSPTMVN 121
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEH--ADVNLKDNGRTALHYAarsGHLEIVK 78

                   ....*.
gi 1958747106  122 LLLGKG 127
Cdd:pfam12796   79 LLLEKG 84
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
41-115 7.68e-15

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 77.69  E-value: 7.68e-15
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVAN 115
Cdd:COG0666    186 GETPLHLAAENGHLEIVKLLLEAGADVNAKDNDGKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAA 260
PTZ00121 PTZ00121
MAEBL; Provisional
498-1108 3.26e-14

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 79.41  E-value: 3.26e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  498 KSFTYEYEDSKQKPDKAILLENDVSTENKLKVLKHDRDKLSKTKSEDKEWLFKDEKALKRMKDVSKDTSRAFREERDRPS 577
Cdd:PTZ00121  1105 KTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKA 1184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  578 KTER---ERSTKEKSPKEEKLRLYKEERKKKSKDR---PFKLEKKNDMKEVSKEKEKAFREDKEKLKKEKLCRDDAAFDD 651
Cdd:PTZ00121  1185 EEVRkaeELRKAEDARKAEAARKAEEERKAEEARKaedAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAH 1264
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  652 YCNKSQFLDHEDTKFS--LSDDQQERWFSDLSDSSFDFKGEDSWDSVTDYRdiKSDSVAKLILETVKEDSKEKKRDNKTR 729
Cdd:PTZ00121  1265 FARRQAAIKAEEARKAdeLKKAEEKKKADEAKKAEEKKKADEAKKKAEEAK--KADEAKKKAEEAKKKADAAKKKAEEAK 1342
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  730 EKRDFRDSFFRKRDRDCvdRNSEKRRDHTEKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERRDGRIRSEEVHR-- 807
Cdd:PTZ00121  1343 KAAEAAKAEAEAAADEA--EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKka 1420
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  808 EDLKECGCDSTFKDKSDCDFTKTLEPWERPHAAREKEKKDALEKDRKEKGRAEKYKDKSGERERNE---KSILEKCQKDK 884
Cdd:PTZ00121  1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADeakKKAEEAKKKAD 1500
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  885 EFEKCFKEKKDGKEKHKDTHSK--------DRKTPFDQLR--EKKEKAFSSLISEDFSERKDDRKGKEKS-----WYIAD 949
Cdd:PTZ00121  1501 EAKKAAEAKKKADEAKKAEEAKkadeakkaEEAKKADEAKkaEEKKKADELKKAEELKKAEEKKKAEEAKkaeedKNMAL 1580
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  950 IFTDESEDEKEECVASSFKTGETGDSQRAESLQEKEDGREHpSDRHRKASSDRQHTEKPRDKEPKEKRKD---RGAAEGG 1026
Cdd:PTZ00121  1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIK-AEELKKAEEEKKKVEQLKKKEAEEKKKAeelKKAEEEN 1659
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1027 KDKKEKIFEKhkekkdkecAEKYKERKDRASVDSAPEKKNKQKLPEKVEKKHFVEDKAKSKHKEKPEKDHSRERKSSRGP 1106
Cdd:PTZ00121  1660 KIKAAEEAKK---------AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730

                   ..
gi 1958747106 1107 DV 1108
Cdd:PTZ00121  1731 KA 1732
PTZ00121 PTZ00121
MAEBL; Provisional
550-1104 1.11e-13

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 77.49  E-value: 1.11e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  550 KDEKALKRMKDVSKDTSRAFREERDRPSKTERER--------STKEKSPKEEKLRLYKEERKKKSKDRPFKLEKKNDMKE 621
Cdd:PTZ00121  1224 KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFeearmahfARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKK 1303
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  622 VSKEKEKAFREDKEKLKKEKLCRDDAAFDDYCNKSQfldhEDTKFSLSDDQQERWFSDLSDSSFDFKGEDswDSVTDYRD 701
Cdd:PTZ00121  1304 ADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAE----EAKKAAEAAKAEAEAAADEAEAAEEKAEAA--EKKKEEAK 1377
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  702 IKSDSVAKLILETVKEDSKEKKRDNKTREKRDFRDSFFRKRDRDCVDRNSEKRRDHTE-KQRSFPSYLSEKDKKRRESAE 780
Cdd:PTZ00121  1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEaKKKAEEAKKADEAKKKAEEAK 1457
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  781 GGRDRRDTLEGSRERRDGRIRSEEVHR-EDLKECGCDStfKDKSDcDFTKTLEPWERPHAAREKEKKDALEKDRKekgrA 859
Cdd:PTZ00121  1458 KAEEAKKKAEEAKKADEAKKKAEEAKKaDEAKKKAEEA--KKKAD-EAKKAAEAKKKADEAKKAEEAKKADEAKK----A 1530
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  860 EKYKDKSGERERNEKSILEKCQKDKEFEKCfKEKKDGKEKHKDTHSKD----RKTPFDQLREKKEKAFSSLISEDFSERK 935
Cdd:PTZ00121  1531 EEAKKADEAKKAEEKKKADELKKAEELKKA-EEKKKAEEAKKAEEDKNmalrKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  936 DDRKGKEKSWYIADIFTDESEDEKEecvASSFKTGETGDSQRAESL-QEKEDGREHPSDRHRKASSDRQHTEKPRDKEPK 1014
Cdd:PTZ00121  1610 EEAKKAEEAKIKAEELKKAEEEKKK---VEQLKKKEAEEKKKAEELkKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1015 EKRKDRGAAEGGKDKKEKIFEKHKEKKDKECAEKYKERKDRASVDSAPEKKNKQKLPEKVEK-KHFVEDKAKSKHKEKPE 1093
Cdd:PTZ00121  1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEaKKDEEEKKKIAHLKKEE 1766
                          570
                   ....*....|.
gi 1958747106 1094 KDHSRERKSSR 1104
Cdd:PTZ00121  1767 EKKAEEIRKEK 1777
PTZ00121 PTZ00121
MAEBL; Provisional
519-1380 2.27e-13

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 76.72  E-value: 2.27e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  519 NDVSTENKLKVLKHDRDKLSKTKSEDKEWLFKDEkalkRMKDVSKDTSRAFREERDRPSKTERERSTKEKSPKEEKLRLY 598
Cdd:PTZ00121  1037 NNDDVLKEKDIIDEDIDGNHEGKAEAKAHVGQDE----GLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAE 1112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  599 KEERKKKSKDRP---FKLEKKNDMKEVSKEKEKAFREDKEKLKKEKLCRDDAAFDDYCNKSQFLDHEDTKFSLSDDQQE- 674
Cdd:PTZ00121  1113 EARKAEEAKKKAedaRKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEe 1192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  675 -RWFSDL--SDSSFDFKGEDSWDSVTDYRDIKSDSVAKLIlETVKEDSKEKKRDNKTR---EKRDFRDSFFRKRDRDCVD 748
Cdd:PTZ00121  1193 lRKAEDArkAEAARKAEEERKAEEARKAEDAKKAEAVKKA-EEAKKDAEEAKKAEEERnneEIRKFEEARMAHFARRQAA 1271
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  749 ------RNSEKRRDHTEKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERRDGRIRSEEVHREDLKECGCDSTFKDK 822
Cdd:PTZ00121  1272 ikaeeaRKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE 1351
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  823 SDCDFTKTLEPWERPHAAR-----EKEKKDALEKDRKEKGRAEKYKDKSGERERNEKSILEKCQKDK---EFEKCFKEKK 894
Cdd:PTZ00121  1352 AEAAADEAEAAEEKAEAAEkkkeeAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKkadEAKKKAEEKK 1431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  895 DGKEKHKDTHSKDRKTPFDQLREKKEKAFSSLISEDFSERKDDRKGKEKSWYIADIFTDESEDEKEEcvASSFKTGETgD 974
Cdd:PTZ00121  1432 KADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK--ADEAKKAAE-A 1508
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  975 SQRAESLQEKEDGREhpSDRHRKASSDRQHTEKPRDKEPKEKRKDRGAAEGGKDKKEKifekhkekkDKECAEKYKERKD 1054
Cdd:PTZ00121  1509 KKKADEAKKAEEAKK--ADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK---------KAEEAKKAEEDKN 1577
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1055 RASVDSAPEKKNKQKLPEKVEKKHFVEDKAKSKHKEKPEKDHSRERKSSRGPDVEKSLLEKLEEEALHDYREDSNDKISE 1134
Cdd:PTZ00121  1578 MALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE 1657
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1135 VSSDSFADHGQ--EPSLSTLLEVSFSEppaEDKARESTCLSEKLKERERERHRHSSSSSKKSHERERAKKEKADKKEKGE 1212
Cdd:PTZ00121  1658 ENKIKAAEEAKkaEEDKKKAEEAKKAE---EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1213 DYKDGGGGRKDASQYEKDFADAEAFGGSYTTKADTEEDLDKAIELFSSEKKDRNDSEREpaKKLEKELKPYGSSTISILK 1292
Cdd:PTZ00121  1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRR--MEVDKKIKDIFDNFANIIE 1812
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1293 EKKKREKHREKWREeKEKHRDKHIDGFLRHHKDEPKPAAKDKDNPPNCFKEKSREESLKLSEAKLKEKFKENAerEKGDS 1372
Cdd:PTZ00121  1813 GGKEGNLVINDSKE-MEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEI--EEADE 1889

                   ....*...
gi 1958747106 1373 VKMSNGND 1380
Cdd:PTZ00121  1890 IEKIDKDD 1897
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
40-127 3.22e-12

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 69.60  E-value: 3.22e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   40 AGWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NS 116
Cdd:COG0666     53 LGALLLLAAALAGDLLVALLLLAAGADINAKDDGGNTLLHAAARNGDLEIVKLLLEAGADVNARDKDGETPLHLAaynGN 132
                           90
                   ....*....|.
gi 1958747106  117 PTMVNLLLGKG 127
Cdd:COG0666    133 LEIVKLLLEAG 143
PHA03247 PHA03247
large tegument protein UL36; Provisional
1698-2209 3.33e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 72.66  E-value: 3.33e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1698 PPPDSvfSNLPPKSSPSPrgELLTPAIEG-ALPPDL-----GLPLDATEDQQATAAILPPEPSylepldegpfntvitee 1771
Cdd:PHA03247  2502 GPPDP--DAPPAPSRLAP--AILPDEPVGePVHPRMltwirGLEELASDDAGDPPPPLPPAAP----------------- 2560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1772 pvewtHSAAEQSLPSSLIASASETPVswpVGSELmlKSPQRFAESPKHFCPGEPlhsttPGPFSAAEPTYPVSPGSYPL- 1850
Cdd:PHA03247  2561 -----PAAPDRSVPPPRPAPRPSEPA---VTSRA--RRPDAPPQSARPRAPVDD-----RGDPRGPAPPSPLPPDTHAPd 2625
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1851 ---PAPEPALEEVKDGGTGAIPVAIAAAEGAAPYTAPTRLESFFSNCKPHPDAPLDTAPEPASVTTVAQVEALG---PLE 1924
Cdd:PHA03247  2626 pppPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdppPPP 2705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1925 SSFLDSSHSISALSQVEPVSWHEAFTSPEDDLDLGPFSLPELPlqAKDASDVEAETAEASPAPPVESPP-GPTGVlsggd 2003
Cdd:PHA03247  2706 PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP--ATPGGPARPARPPTTAGPPAPAPPaAPAAG----- 2778
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2004 vPASTTEEPPAPPPQEASPQLSTEPEPSEETKldVVLEAAAETEVLADDSAPEASISNLVPAPSPPEQQRP--------- 2074
Cdd:PHA03247  2779 -PPRRLTRPAVASLSESRESLPSPWDPADPPA--AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplggs 2855
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2075 -AGGGDEETEAEDPSAAPC-CAPDGPTTDGLAQAPNSAEAACVVAAVEGPPGNIQPEA-----------TDPEPKPTSEA 2141
Cdd:PHA03247  2856 vAPGGDVRRRPPSRSPAAKpAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAppppqpqpqppPPPQPQPPPPP 2935
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958747106 2142 PKAPKVEEVPQRMTRNRAQMLASQSKQGIPATEKDSMPAPASRAKGRAPEEEDAQAQHPRKRRFQRSG 2209
Cdd:PHA03247  2936 PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
Ank_2 pfam12796
Ankyrin repeats (3 copies);
41-104 5.03e-12

Ankyrin repeats (3 copies);


Pssm-ID: 463710 [Multi-domain]  Cd Length: 91  Bit Score: 63.98  E-value: 5.03e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLA-AGAEVNTKGlddDTPLHDAANNGHYKVVKLLLRYGGNPQQSN 104
Cdd:pfam12796   30 GRTALHLAAKNGHLEIVKLLLEhADVNLKDNG---RTALHYAARSGHLEIVKLLLEKGADINVKD 91
PTZ00121 PTZ00121
MAEBL; Provisional
672-1396 1.13e-11

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 70.94  E-value: 1.13e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  672 QQERWFSDLSDSSFDFKGEDSWDSVTDYRDIKSDSVAKLILETVKEDSKEKKRDNKTREKRDFRDSffrkRDRDCVDRNS 751
Cdd:PTZ00121  1068 QDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEA----RKAEDARKAE 1143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  752 EKRRdhTEKQRSFPSYLSEKDKKRRESAEGGRDRRdTLEGSRERRDGRiRSEEVHR-EDLKECGCDSTFKDKSDCDFTKT 830
Cdd:PTZ00121  1144 EARK--AEDAKRVEIARKAEDARKAEEARKAEDAK-KAEAARKAEEVR-KAEELRKaEDARKAEAARKAEEERKAEEARK 1219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  831 LEPWERPHAAR--EKEKKDALEKDRKEKGRAEKYKDKSGERER------------NEKSILEKCQKDKEFEKCfKEKKDG 896
Cdd:PTZ00121  1220 AEDAKKAEAVKkaEEAKKDAEEAKKAEEERNNEEIRKFEEARMahfarrqaaikaEEARKADELKKAEEKKKA-DEAKKA 1298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  897 KEKHKDTHSKDRKTPFDQLREKKEKAFSSLISEDFSERKDDRKGK--EKSWYIADIFTDESEDEKEECVASSFKTGEtgD 974
Cdd:PTZ00121  1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKaaEAAKAEAEAAADEAEAAEEKAEAAEKKKEE--A 1376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  975 SQRAESLQEKEDGREHPSDRHRKASSDRQHTEKPRDKEPKEKRkdrgaAEGGKDKKEKIFEKHKEKKDKECAEKYKERKD 1054
Cdd:PTZ00121  1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK-----ADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1055 RASvdsapEKKNKQKLPEKVEKKHfVEDKAKSKHKEKPEKDHSRERKSSRGPDVEKSLLEKLEEEALHDYRE-------D 1127
Cdd:PTZ00121  1452 KAE-----EAKKAEEAKKKAEEAK-KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKaeeakkaD 1525
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1128 SNDKISEVSSDSFADHGQEPSLSTLLEVSFSEPPAEDKARESTCLSEKLKERERERHRHSSSSSKKSHERERAKKEKADK 1207
Cdd:PTZ00121  1526 EAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEK 1605
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1208 KEKGEDYKDGGGGRKDASQYEKdfaDAEAFGGSYTTKADTEEDLDKAIELFSSEKKDRNDSEREPAKKLEKELKPYGSST 1287
Cdd:PTZ00121  1606 KMKAEEAKKAEEAKIKAEELKK---AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1288 isilKEKKKREKHREKWREEKEKHRDKHIDGFLRHHK---DEPKPAAKDKDNPPNCFKEKSREESLKLSEAKLKEKFKEN 1364
Cdd:PTZ00121  1683 ----AEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKkkaEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758
                          730       740       750
                   ....*....|....*....|....*....|....*
gi 1958747106 1365 AEREKGDSVKMSNGNDKPLPS---RDANKKDSRPR 1396
Cdd:PTZ00121  1759 IAHLKKEEEKKAEEIRKEKEAvieEELDEEDEKRR 1793
PTZ00121 PTZ00121
MAEBL; Provisional
534-1175 1.02e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.86  E-value: 1.02e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  534 RDKLSKTKSEDKEWLFKDEKALKRMKDVSKDTSRAFREERDRPSKTERERSTKEKSPKEEKLRLYKEERKKKSKDRPFKL 613
Cdd:PTZ00121  1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  614 EKKNDMKEVSKEKEKAFREDKEKLKKEKLCRDDAAfddycnKSQFLDHEDTKFSLSDDQQERWFSDLSDSSFDFKGEDSW 693
Cdd:PTZ00121  1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEA------KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA 1488
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  694 DSVTDYRDIKSDSVAKliletvKEDSKEKKRDNKTREKRDFRDSFFRKRDRdcvdRNSEKRRDHTEKQRSFPSYLSEKDK 773
Cdd:PTZ00121  1489 KKKAEEAKKKADEAKK------AAEAKKKADEAKKAEEAKKADEAKKAEEA----KKADEAKKAEEKKKADELKKAEELK 1558
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  774 KRRESAEGGRDRRdtlegSRERRDGRIRSEEVHREDLKecgcdstfkdksdcdftKTLEPWERPHAAREKEKKDALEKDR 853
Cdd:PTZ00121  1559 KAEEKKKAEEAKK-----AEEDKNMALRKAEEAKKAEE-----------------ARIEEVMKLYEEEKKMKAEEAKKAE 1616
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  854 KEKGRAEKYKDKSGERERNEKSILEKCQKDKEFEKCFKEKKDGKEKHKDTHSKDrktpfdqlREKKEKAfSSLISEDFSE 933
Cdd:PTZ00121  1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKA--------EEDKKKA-EEAKKAEEDE 1687
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  934 RKDDRKGKEKSWYIADIFTDESEDEKEECVASSFKTGETGDSQRAESLQEKEDGREHPSDRHRKASSDR---QHTEKPRD 1010
Cdd:PTZ00121  1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKkkiAHLKKEEE 1767
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1011 KEPKEKRKDRGAA--EGGKDKKEKifekhkekKDKECAEKYKERKDRASVDSAPEKK-----NKQKLPEKVEKKHFVEDK 1083
Cdd:PTZ00121  1768 KKAEEIRKEKEAVieEELDEEDEK--------RRMEVDKKIKDIFDNFANIIEGGKEgnlviNDSKEMEDSAIKEVADSK 1839
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1084 AKSKHKEKPEKDHSRERKSSRGPDVEKSLLEKLEEEALHDYRE-----------DSNDKISEVSSDSFADHGQEPSLSTL 1152
Cdd:PTZ00121  1840 NMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEeieeadeiekiDKDDIEREIPNNNMAGKNNDIIDDKL 1919
                          650       660
                   ....*....|....*....|...
gi 1958747106 1153 LEVSFSEPPAEDKARESTCLSEK 1175
Cdd:PTZ00121  1920 DKDEYIKRDAEETREEIIKISKK 1942
PHA03247 PHA03247
large tegument protein UL36; Provisional
1634-2183 9.61e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 9.61e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1634 QSVPAASSFDSPVQHLLEEKAPLPPVPAEKFACLSPEYYSPDYGIPSPkvdtlhcpPTAVVSATPPPDsvfsnlPPKSSP 1713
Cdd:PHA03247  2566 RSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAP--------PSPLPPDTHAPD------PPPPSP 2631
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1714 SPRGELLTPAIEGALPPD-----------LGLPLDATEDQQATAAILPPE-------------------PSYLEPLDEGP 1763
Cdd:PHA03247  2632 SPAANEPDPHPPPTVPPPerprddpapgrVSRPRRARRLGRAAQASSPPQrprrraarptvgsltsladPPPPPPTPEPA 2711
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1764 FNTVITEEPVEWTHSAAEQSLPSSLIASASETPVSWPVGSelmlKSPQRFAESPkhfcpgeplhsTTPGPFSAAEPTYPV 1843
Cdd:PHA03247  2712 PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP----GGPARPARPP-----------TTAGPPAPAPPAAPA 2776
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1844 SPGSYPLPAPEPALEEVkdgGTGAIPVAIAAAEGAAPYTAPTRLEsffsnckPHPDAPLDTAPEPASVTTVAQVEALGPL 1923
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSE---SRESLPSPWDPADPPAAVLAPAAAL-------PPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1924 ESSfldsshsisalsqvepvswheaftspeddLDLGPFSLPELPLQAKDASDVEAETAEASPAPPVESPPGPTgvlsggd 2003
Cdd:PHA03247  2847 PPS-----------------------------LPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPA------- 2890
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2004 VPASTTEEPPAPPPQEASPQLSTEPEPSEETKLDVVLEaaaetevladdsapeasisnlvPAPSPPEQQRPAgggdeete 2083
Cdd:PHA03247  2891 VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ----------------------PQPPPPPPPRPQ-------- 2940
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2084 aedpsaapccAPDGPTTDGLAQAPNSAEAACVVAAVEGPPGNIQPEATDPEPKPTSEAPKAPkveevPQRMTRNRAQMLA 2163
Cdd:PHA03247  2941 ----------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASS-----TPPLTGHSLSRVS 3005
                          570       580
                   ....*....|....*....|
gi 1958747106 2164 SQSKQgiPATEKDSMPAPAS 2183
Cdd:PHA03247  3006 SWASS--LALHEETDPPPVS 3023
PHA03247 PHA03247
large tegument protein UL36; Provisional
1478-1995 1.55e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 1.55e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1478 PAP--VLPTGEGKPH-SGPGTESKDWLAGQPLKEVLPASPRTEQGRPTGVPTPTSVVSCPsyeevmHTPRTPSCSAddyp 1554
Cdd:PHA03247  2562 AAPdrSVPPPRPAPRpSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDT------HAPDPPPPSP---- 2631
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1555 dlvfdctdSQHSMPVSTTSTSACSPPFFDRFSVASSVV-----------AENAGQTPTRPistnlyrsisvdirRTPEEE 1623
Cdd:PHA03247  2632 --------SPAANEPDPHPPPTVPPPERPRDDPAPGRVsrprrarrlgrAAQASSPPQRP--------------RRRAAR 2689
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1624 FSVGdklfrqqsvPAASSFDSPVQHLLEEKAPLPPVPAEKfacLSPEYYSPDYGIPSPKVDTLhCPPTAVVSATP----- 1698
Cdd:PHA03247  2690 PTVG---------SLTSLADPPPPPPTPEPAPHALVSATP---LPPGPAAARQASPALPAAPA-PPAVPAGPATPggpar 2756
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1699 PPDSVFSNLPPKSSPsPRGELLTPaiegalPPDLGLPLDATEDQQATAAILPPEPSYLEPLDEGPFNTVITEEPVEWTHS 1778
Cdd:PHA03247  2757 PARPPTTAGPPAPAP-PAAPAAGP------PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1779 AAEQSLPSSLIASASETPVSWPVGSELMLKSPQRfAESPKHFCPGEPLHSTTPGPFSAAEPTYPVSPGSYPLPAPEPALE 1858
Cdd:PHA03247  2830 PPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR-RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1859 EVKDGGTGAIPvaiAAAEGAAPYTAPTRLESFFSNCKPHPDAPLDTAPEPASVTTVAQVEALGPLE-------------- 1924
Cdd:PHA03247  2909 PQPQAPPPPQP---QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRvavprfrvpqpaps 2985
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1925 ------SSFLDSSHSISALSQ------------VEPVSWHEAFTSPEDDLDLGPFSLPELPLQAKDASDVEAETAEasPA 1986
Cdd:PHA03247  2986 reapasSTPPLTGHSLSRVSSwasslalheetdPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPE--PH 3063

                   ....*....
gi 1958747106 1987 PPVESPPGP 1995
Cdd:PHA03247  3064 DPFAHEPDP 3072
Ank_4 pfam13637
Ankyrin repeats (many copies);
41-94 7.85e-09

Ankyrin repeats (many copies);


Pssm-ID: 372654 [Multi-domain]  Cd Length: 54  Bit Score: 53.43  E-value: 7.85e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLL 94
Cdd:pfam13637    1 ELTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
PHA02878 PHA02878
ankyrin repeat protein; Provisional
43-128 1.41e-08

ankyrin repeat protein; Provisional


Pssm-ID: 222939 [Multi-domain]  Cd Length: 477  Bit Score: 59.89  E-value: 1.41e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   43 TALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVANSP----T 118
Cdd:PHA02878   170 TALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGNTPLHISVGYckdyD 249
                           90
                   ....*....|
gi 1958747106  119 MVNLLLGKGT 128
Cdd:PHA02878   250 ILKLLLEHGV 259
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
57-125 2.44e-08

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 59.53  E-value: 2.44e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958747106   57 AKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NSPTMVNLLLG 125
Cdd:PTZ00322    98 ARILLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAeenGFREVVQLLSR 169
PHA03095 PHA03095
ankyrin-like protein; Provisional
41-127 2.61e-07

ankyrin-like protein; Provisional


Pssm-ID: 222980 [Multi-domain]  Cd Length: 471  Bit Score: 55.80  E-value: 2.61e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   41 GWTALH-EACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHD-AAN-NGHYKVVKLLLRYGGNPQQSNRKGETPLKV---- 113
Cdd:PHA03095    83 GFTPLHlYLYNATTLDVIKLLIKAGADVNAKDKVGRTPLHVyLSGfNINPKVIRLLLRKGADVNALDLYGMTPLAVllks 162
                           90
                   ....*....|....*
gi 1958747106  114 AN-SPTMVNLLLGKG 127
Cdd:PHA03095   163 RNaNVELLRLLIDAG 177
PHA02874 PHA02874
ankyrin repeat protein; Provisional
45-114 2.86e-07

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 55.74  E-value: 2.86e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   45 LHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA 114
Cdd:PHA02874   161 IHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHIMNKCKNGFTPLHNA 230
Ank_5 pfam13857
Ankyrin repeats (many copies);
60-114 1.18e-06

Ankyrin repeats (many copies);


Pssm-ID: 433530 [Multi-domain]  Cd Length: 56  Bit Score: 47.34  E-value: 1.18e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1958747106   60 LLAAG-AEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA 114
Cdd:pfam13857    1 LLEHGpIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTALDLA 56
PHA03100 PHA03100
ankyrin repeat protein; Provisional
41-127 1.91e-06

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 52.75  E-value: 1.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   41 GWTALHEA--CNRGYYDIAKQLLAAGAEVN---------TKGLDDD-------TPLHDAANNGHYKVVKLLLRYGGNPQQ 102
Cdd:PHA03100   141 GENLLHLYleSNKIDLKILKLLIDKGVDINaknrvnyllSYGVPINikdvygfTPLHYAVYNNNPEFVKYLLDLGANPNL 220
                           90       100
                   ....*....|....*....|....*...
gi 1958747106  103 SNRKGETPLKVA---NSPTMVNLLLGKG 127
Cdd:PHA03100   221 VNKYGDTPLHIAilnNNKEIFKLLLNNG 248
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1653-2029 2.23e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 53.23  E-value: 2.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1653 KAPLPPVPAekfaCLSPEYYSPDYGIPSPKVDTLHCPPTAVVSATPPPDSVFSNLPPKSSPSPRGELLTPAIEGALPPDL 1732
Cdd:pfam03154  175 QAQSGAASP----PSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQ 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1733 GLPLDATEDQQATAAIlpPEPSYLEPLDEGPFNtvITEEPVEWTHSAAEQSLPSSLIASASETPVSwpvgselmlksPQR 1812
Cdd:pfam03154  251 PMTQPPPPSQVSPQPL--PQPSLHGQMPPMPHS--LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPG-----------PSP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1813 FAESPKHFCPgeplhsTTPGPFSAAEPTYPvsPGSYPLPAPEPALEEVKDGGTGAIPVAIAAAEGAAP--YTAPTRLeSF 1890
Cdd:pfam03154  316 AAPGQSQQRI------HTPPSQSQLQSQQP--PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPphLSGPSPF-QM 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1891 FSNCKPHP----------DAPLDTAPEPASVTTVAQVEALGPLESSFLDSSHSISALSQVEPVSWHEAFTSPEDDLDLGP 1960
Cdd:pfam03154  387 NSNLPPPPalkplsslstHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHP 466
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 1961 FsLPELPLQAKDASDVEAETaeaSPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQLSTEPE 2029
Cdd:pfam03154  467 F-VPGGPPPITPPSGPPTST---SSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPE 531
PHA03100 PHA03100
ankyrin repeat protein; Provisional
45-127 4.69e-06

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 51.59  E-value: 4.69e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   45 LHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHY-----KVVKLLLRYGGNPQQSNRKGETPLKVA----- 114
Cdd:PHA03100    39 LYLAKEARNIDVVKILLDNGADINSSTKNNSTPLHYLSNIKYNltdvkEIVKLLLEYGANVNAPDNNGITPLLYAiskks 118
                           90
                   ....*....|...
gi 1958747106  115 NSPTMVNLLLGKG 127
Cdd:PHA03100   119 NSYSIVEYLLDNG 131
PHA03100 PHA03100
ankyrin repeat protein; Provisional
41-99 5.29e-06

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 51.59  E-value: 5.29e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGN 99
Cdd:PHA03100   192 GFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLLLNNGPS 250
PHA02878 PHA02878
ankyrin repeat protein; Provisional
55-129 5.42e-06

ankyrin repeat protein; Provisional


Pssm-ID: 222939 [Multi-domain]  Cd Length: 477  Bit Score: 51.42  E-value: 5.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   55 DIAKQLLAAGAEVNTKGLD-DDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA----NSPtMVNLLLGKGTY 129
Cdd:PHA02878   148 EITKLLLSYGADINMKDRHkGNTALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAvkhyNKP-IVHILLENGAS 226
PTZ00121 PTZ00121
MAEBL; Provisional
822-1477 6.71e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 52.07  E-value: 6.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  822 KSDCDFTKTLEPWERPHAAREKEKKDALEKDRKEKGRAE---KYKDKSGERERNEKSILEKCQKDKEFEKCFKEKKDGKE 898
Cdd:PTZ00121  1063 KAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEeakKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKA 1142
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  899 KHKDTHSKDRKTPFDQLREKKEKAFSSLISEDFSERKDDRKGKE----KSWYIADIFTDESEDEKEECVASSFKTGETGD 974
Cdd:PTZ00121  1143 EEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEvrkaEELRKAEDARKAEAARKAEEERKAEEARKAED 1222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  975 SQRAESLQEKEDGREHpSDRHRKASSDRQHTEKPRDKEPKEKRKDRGAAEGGKDKKEKIfEKHKEKKDKECAEKYKERKD 1054
Cdd:PTZ00121  1223 AKKAEAVKKAEEAKKD-AEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKA-DELKKAEEKKKADEAKKAEE 1300
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1055 RASVDS----APEKKNKQKLPEKVEKKHFVEDKAKSKHKEKPEKDHSRERKSSRGPDVEKSLLEKLEEEALHDYR----- 1125
Cdd:PTZ00121  1301 KKKADEakkkAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEakkka 1380
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1126 EDSNDKISEVSSDSFADHGQEPSLSTLLEVSFSEpPAEDKARESTCLSEKLKERERERHRHSSSSSKKSHERERAKKEKA 1205
Cdd:PTZ00121  1381 DAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA-AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKA 1459
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1206 DKKEKGEDYKDGGGGRKDASQYEKDFADAEAFGGSYTTKAD----TEEDLDKAIELFSSEKKDRNDSER--EPAKKLEKE 1279
Cdd:PTZ00121  1460 EEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADeakkAAEAKKKADEAKKAEEAKKADEAKkaEEAKKADEA 1539
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1280 LKPYGSSTISILKEKKKREKHREKWREEKEKHRDKHIDGFLRHHKDEPKPAAKDKDNPPNCFKEKSR---EESLKLSEAK 1356
Cdd:PTZ00121  1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKmkaEEAKKAEEAK 1619
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1357 LK--EKFKENAEREKGDSVKMSNGNDK----PLPSRDANKKDSRPREKLLGDGDLMMTSFERMLSQKDLEIEERHKRHKE 1430
Cdd:PTZ00121  1620 IKaeELKKAEEEKKKVEQLKKKEAEEKkkaeELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|....*....
gi 1958747106 1431 RMKQMEKMRHRSGDPKLK--EKKPTDDGRKKSLDFPSKKALGLDKKVKE 1477
Cdd:PTZ00121  1700 EAKKAEELKKKEAEEKKKaeELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748
PHA03247 PHA03247
large tegument protein UL36; Provisional
1442-1787 7.94e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 7.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1442 SGDPKLKEKKPTDDGRKKSLDFPSKKALGLDKKVKEPAPVLPTGEGKPhSGPGTESKDWLAGQPLKEVLPASPRTEQGRP 1521
Cdd:PHA03247  2698 LADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP-AGPATPGGPARPARPPTTAGPPAPAPPAAPA 2776
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1522 TGVP---TPTSVVSCPSYEEVMHTPRTPSCSADDYPDLVFDCTDSQHSMPVSTTSTSAC-------SPPFFDRFSVASSV 1591
Cdd:PHA03247  2777 AGPPrrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQptappppPGPPPPSLPLGGSV 2856
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1592 V--AENAGQTPTRPISTNLYRSISVDIRRTPEEEFSVGDKLFRQ---------QSVPAASSFDSPVQHLLEEKAPLPPVP 1660
Cdd:PHA03247  2857 ApgGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALppdqperppQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1661 AEKFACLSPEYYSPDYGIPSPKVDTLH----CPPTAVVSATPPPDSVFSNLPPKSSPSPRGELLTPAI----------EG 1726
Cdd:PHA03247  2937 PRPQPPLAPTTDPAGAGEPSGAVPQPWlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVsswasslalhEE 3016
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 1727 ALPPDLGL-----PLDATEDQQATAAILP-PEPSYLEPLD--EGPFNTVITEEPVEWTHSAAEQSLPSS 1787
Cdd:PHA03247  3017 TDPPPVSLkqtlwPPDDTEDSDADSLFDSdSERSDLEALDplPPEPHDPFAHEPDPATPEAGARESPSS 3085
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
41-111 9.15e-06

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 440430 [Multi-domain]  Cd Length: 289  Bit Score: 49.95  E-value: 9.15e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPL 111
Cdd:COG0666    219 GKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAAAAGAALIVKLLLLALLLLAAALLDLLTLL 289
PHA03247 PHA03247
large tegument protein UL36; Provisional
1822-2193 1.19e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 1.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1822 PGEPLHSttpgpfSAAEPTYPVSPGsyplPAPEPaleevkdgGTGAIPVAIAAAEGAAPytAPTRLesffsnckphPDAP 1901
Cdd:PHA03247  2475 PGAPVYR------RPAEARFPFAAG----AAPDP--------GGGGPPDPDAPPAPSRL--APAIL----------PDEP 2524
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1902 LDTAPEPASVTTVAQVEALG---------PLESSFLDSSHSISA-LSQVEPVSWHEAFTSPEDDLDLGPFSL-PELPLQA 1970
Cdd:PHA03247  2525 VGEPVHPRMLTWIRGLEELAsddagdpppPLPPAAPPAAPDRSVpPPRPAPRPSEPAVTSRARRPDAPPQSArPRAPVDD 2604
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1971 KDASDVEAETAEASPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQLSTEPEPSEETKLDVVLEAAAETEVLA 2050
Cdd:PHA03247  2605 RGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPR 2684
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2051 DDSAPE--ASISNLV--PAPSPPEQQRP--------------AGGGDEETEAEDPSA-APCCAPDGPTTDGLAQAPNSAE 2111
Cdd:PHA03247  2685 RRAARPtvGSLTSLAdpPPPPPTPEPAPhalvsatplppgpaAARQASPALPAAPAPpAVPAGPATPGGPARPARPPTTA 2764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2112 AACVVAAVEGPPGNIQPEATDPEPKPTSEA----PKAPKVEEVPQRMTRNRAQMLASQSKQGIPATEKDSMPAPASRAKG 2187
Cdd:PHA03247  2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESreslPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844

                   ....*.
gi 1958747106 2188 RAPEEE 2193
Cdd:PHA03247  2845 PPPPSL 2850
ANK smart00248
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ...
73-100 1.50e-05

ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.


Pssm-ID: 197603 [Multi-domain]  Cd Length: 30  Bit Score: 43.73  E-value: 1.50e-05
                            10        20
                    ....*....|....*....|....*...
gi 1958747106    73 DDDTPLHDAANNGHYKVVKLLLRYGGNP 100
Cdd:smart00248    1 DGRTPLHLAAENGNLEVVKLLLDKGADI 28
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
503-1103 1.56e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 50.74  E-value: 1.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  503 EYEDSKQKPDKAILLENDVSTENKLKVLKH----------DRDKLSKTKSEDKEWLFKDEKALKRMKDVSKDTSRAFREE 572
Cdd:pfam02463  202 LKEQAKKALEYYQLKEKLELEEEYLLYLDYlklneeridlLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEK 281
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  573 rdrpSKTERERSTKEKSPKEEKLRLYKEERKKKSKDRPFKL-EKKNDMKEVSKEKEKAFREDKEKLKKEKLcRDDAAFDd 651
Cdd:pfam02463  282 ----KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKEsEKEKKKAEKELKKEKEEIEELEKELKELE-IKREAEE- 355
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  652 ycNKSQFLDHEDTKFSLSDDQQERWFSDLSDSSFDFKGEDSWDSVTDYRDIKSDSVAKLILETVKEDSKE--KKRDNKTR 729
Cdd:pfam02463  356 --EEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEekKEELEILE 433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  730 EKRDFRDSffrKRDRDCVDRNSEKRRDHTEKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERRDGRIRSEEVHRED 809
Cdd:pfam02463  434 EEEESIEL---KQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLK 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  810 LKECGCDSTFKDKSDCDFTKTlepwERPHAAREKEK------------KDALEKDRKEKGRAEKYKDKSGERERNEKSIL 877
Cdd:pfam02463  511 VLLALIKDGVGGRIISAHGRL----GDLGVAVENYKvaistavivevsATADEVEERQKLVRALTELPLGARKLRLLIPK 586
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  878 EKCQKDKEFEK-----CFKEKKDGKEKHKDTHSKDRKTPFDQLREKKEK--------------AFSSLISEDFSERKDDR 938
Cdd:pfam02463  587 LKLPLKSIAVLeidpiLNLAQLDKATLEADEDDKRAKVVEGILKDTELTklkesakakesglrKGVSLEEGLAEKSEVKA 666
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  939 KGKEKSWYIADIftdESEDEKEECVASSFKTGETGDSQRAESLQEKEDGREHPSDRHRKASSDRQHTEKPRDKEPKEKRK 1018
Cdd:pfam02463  667 SLSELTKELLEI---QELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQ 743
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1019 DRGAAEggkDKKEKIFEKHKEKKDKECAEKYKERKDRASVDSAPEKKNKQKLPEKVEKKHFVEDKAKSKHKEKPEKDHSR 1098
Cdd:pfam02463  744 KIDEEE---EEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEE 820

                   ....*
gi 1958747106 1099 ERKSS 1103
Cdd:pfam02463  821 QLLIE 825
PHA02874 PHA02874
ankyrin repeat protein; Provisional
55-129 6.28e-05

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 48.04  E-value: 6.28e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958747106   55 DIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NSPTMVNLLLGKGTY 129
Cdd:PHA02874   105 DMIKTILDCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAikhNFFDIIKLLLEKGAY 182
PHA02876 PHA02876
ankyrin repeat protein; Provisional
43-127 9.83e-05

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 47.75  E-value: 9.83e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   43 TALHEA-CNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAA-NNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA-NSPTM 119
Cdd:PHA02876   410 TALHFAlCGTNPYMSVKTLIDRGANVNSKNKDLSTPLHYACkKNCKLDVIEMLLDNGADVNAINIQNQYPLLIAlEYHGI 489

                   ....*...
gi 1958747106  120 VNLLLGKG 127
Cdd:PHA02876   490 VNILLHYG 497
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
73-105 1.00e-04

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 459634 [Multi-domain]  Cd Length: 34  Bit Score: 41.51  E-value: 1.00e-04
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1958747106   73 DDDTPLHDAA-NNGHYKVVKLLLRYGGNPQQSNR 105
Cdd:pfam00023    1 DGNTPLHLAAgRRGNLEIVKLLLSKGADVNARDK 34
PHA02874 PHA02874
ankyrin repeat protein; Provisional
43-111 1.25e-04

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 47.27  E-value: 1.25e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106   43 TALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPL 111
Cdd:PHA02874   126 TFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPL 194
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
759-921 1.26e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 47.43  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  759 EKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERRDGRIRSEEVHREDLKECgcdstfkdksdcdftktlepwERPH 838
Cdd:pfam17380  410 ERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLEEQERQQQVER---------------------LRQQ 468
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  839 AAREKEKKDALEKDRKEKGRAEKYKDKSGERE--RNEKSILEKCQKDKEFEKCFKEKKDG-----------KEKHKDTHS 905
Cdd:pfam17380  469 EEERKRKKLELEKEKRDRKRAEEQRRKILEKEleERKQAMIEEERKRKLLEKEMEERQKAiyeeerrreaeEERRKQQEM 548
                          170
                   ....*....|....*.
gi 1958747106  906 KDRKTPFDQLREKKEK 921
Cdd:pfam17380  549 EERRRIQEQMRKATEE 564
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
41-68 1.28e-04

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 459634 [Multi-domain]  Cd Length: 34  Bit Score: 41.12  E-value: 1.28e-04
                           10        20
                   ....*....|....*....|....*....
gi 1958747106   41 GWTALHEACNR-GYYDIAKQLLAAGAEVN 68
Cdd:pfam00023    2 GNTPLHLAAGRrGNLEIVKLLLSKGADVN 30
Ank_4 pfam13637
Ankyrin repeats (many copies);
76-124 1.29e-04

Ankyrin repeats (many copies);


Pssm-ID: 372654 [Multi-domain]  Cd Length: 54  Bit Score: 41.88  E-value: 1.29e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958747106   76 TPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA---NSPTMVNLLL 124
Cdd:pfam13637    3 TALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAasnGNVEVLKLLL 54
PHA03095 PHA03095
ankyrin-like protein; Provisional
55-127 1.34e-04

ankyrin-like protein; Provisional


Pssm-ID: 222980 [Multi-domain]  Cd Length: 471  Bit Score: 46.94  E-value: 1.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   55 DIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYK---VVKLLLRYGGNPQQSNRKGETPLKV----ANSPTMVNLLLGKG 127
Cdd:PHA03095    28 EEVRRLLAAGADVNFRGEYGKTPLHLYLHYSSEKvkdIVRLLLEAGADVNAPERCGFTPLHLylynATTLDVIKLLIKAG 107
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1976-2204 1.35e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 1.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1976 VEAETAEASPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQlsTEPEPSEETKLDVVLEAAAETEVLADDsap 2055
Cdd:PRK07764   584 VEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAG--AAAAPAEASAAPAPGVAAPEHHPKHVA--- 658
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2056 easisnlVPAPSPPEQQRPAGGGDEETEAEDPSAAPcCAPDGPTTDGLAQAPNSAEAACVVAAVEGPPGNIQPEATDPEP 2135
Cdd:PRK07764   659 -------VPDASDGGDGWPAKAGGAAPAAPPPAPAP-AAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA 730
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 2136 KPTSEAPKAPKVEEVPQRMTRNRAQMLASQSKQGIPATEKDSMPAPASRAKGRAPEEEDAQAQHPRKRR 2204
Cdd:PRK07764   731 PSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRR 799
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1969-2208 2.06e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 46.77  E-value: 2.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1969 QAKDASDVEAETAEASPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQLSTEPEPSEE-----TKLDVVLEAA 2043
Cdd:PRK07003   373 PARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADrgddaADGDAPVPAK 452
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2044 AETEVLADDSAPEASISnlvPAPSPPEQQRPAGGGDEETEAEDPSAAPCCAPDGPTTDGLAQAPNSAEAACVVAAVEGPP 2123
Cdd:PRK07003   453 ANARASADSRCDERDAQ---PPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPA 529
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2124 gniqPEATDPEPKPTSEAPKAP--------------KVEEVPQRMTRNRAQMLASQSKQGIPATEKDSMPAPASRAKGRA 2189
Cdd:PRK07003   530 ----PEARPPTPAAAAPAARAGgaaaaldvlrnagmRVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAAT 605
                          250
                   ....*....|....*....
gi 1958747106 2190 PEEEDAQAQHPRKRRFQRS 2208
Cdd:PRK07003   606 GDAPPNGAARAEQAAESRG 624
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1967-2190 2.74e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.62  E-value: 2.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1967 PLQAKDASDVEAETAEASPAPPVESPPGPTG-------VLSGGDVPASTTEEPPAPPPQEASPQLSTEPEPSEE------ 2033
Cdd:PRK10263   319 PVAVAAAATTATQSWAAPVEPVTQTPPVASVdvppaqpTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQyneplq 398
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2034 ---TKLDVVLEAAAETEVLADDSAPEASISNLVPAPSPPEQQRPAGGGDEETEAEDPSA-APCCAPDGPTTDGLAQAPns 2109
Cdd:PRK10263   399 qpvQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFApQSTYQTEQTYQQPAAQEP-- 476
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2110 aeaacvvaAVEGPPGNIQPEATDPEPKPTSEAPKAPKV---EEVPQRMTRNRAQMLASQskQGIPATEKDSMPAPASRAK 2186
Cdd:PRK10263   477 --------LYQQPQPVEQQPVVEPEPVVEETKPARPPLyyfEEVEEKRAREREQLAAWY--QPIPEPVKEPEPIKSSLKA 546

                   ....
gi 1958747106 2187 GRAP 2190
Cdd:PRK10263   547 PSVA 550
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1828-2100 2.81e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 46.11  E-value: 2.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1828 STTPGPFSAAEPTYPVSPGSYPLPAPEpalEEVKDGGTGAIPVAIAAAEGAAPYTAPTRLESFFSNCKPHPDAPLDTAPE 1907
Cdd:pfam17823  120 SSSPSSAAQSLPAAIAALPSEAFSAPR---AAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPT 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1908 PASVTTVAQVEALGPLESSFLDSSHSI--SALSQVePVSWHEAFTSPEDDLDLGPFSLPELPLQAKD------------- 1972
Cdd:pfam17823  197 TAASSAPATLTPARGISTAATATGHPAagTALAAV-GNSSPAAGTVTAAVGTVTPAALATLAAAAGTvasaagtinmgdp 275
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1973 -------ASDVEAETAEASPAPPVESPP-GPTGVLSGgDVPASTTEEPPAPPPQEASPQLSTePEPSEETKLDVVLEAAA 2044
Cdd:pfam17823  276 harrlspAKHMPSDTMARNPAAPMGAQAqGPIIQVST-DQPVHNTAGEPTPSPSNTTLEPNT-PKSVASTNLAVVTTTKA 353
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958747106 2045 ETE--------VLADDSAPE--ASISNLVPAPSPPEQQRPAGGGDEETEAEDPSAAPCCAPDGPTT 2100
Cdd:pfam17823  354 QAKepsaspvpVLHTSMIPEveATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTP 419
ANK smart00248
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ...
41-68 3.40e-04

ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.


Pssm-ID: 197603 [Multi-domain]  Cd Length: 30  Bit Score: 39.88  E-value: 3.40e-04
                            10        20
                    ....*....|....*....|....*...
gi 1958747106    41 GWTALHEACNRGYYDIAKQLLAAGAEVN 68
Cdd:smart00248    2 GRTPLHLAAENGNLEVVKLLLDKGADIN 29
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
1895-2196 3.68e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 45.82  E-value: 3.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1895 KPHPDAPLDTAPEPASVTTVAQVEALGPLESsfldsshsISALSQVEPVSWHEAFTSPEDDLDLGPFSLPELPLQAKDAS 1974
Cdd:COG5180     77 VAEPEAYLDPAPPKSSPDTPEEQLGAPAGDL--------LVLPAAKTPELAAGALPAPAAAAALPKAKVTREATSASAGV 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1975 DVEAetAEASPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQLSTEPEPSEETKlDVVLEAAAETEVLADDSA 2054
Cdd:COG5180    149 ALAA--ALLQRSDPILAKDPDGDSASTLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVK-DEAQEEPPDLTGGADHPR 225
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2055 PEASISNLVPAPSPPEQQRPAGGGDEETEAEDPSAAPCCAPDGPTTDGLAQAPNSAEAACVVAAVEGPPGNIQPEATDpe 2134
Cdd:COG5180    226 PEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPID-- 303
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 2135 PKPTSEAPKAPKVEEVPQRMTRNRAQM--LASQSKQGIPATEKD-----SMPAPASRAKGRAPEEEDAQ 2196
Cdd:COG5180    304 VKGVASAPPATRPVRPPGGARDPGTPRpgQPTERPAGVPEAASDagqppSAYPPAEEAVPGKPLEQGAP 372
PHA02878 PHA02878
ankyrin repeat protein; Provisional
41-114 6.66e-04

ankyrin repeat protein; Provisional


Pssm-ID: 222939 [Multi-domain]  Cd Length: 477  Bit Score: 44.87  E-value: 6.66e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106   41 GWTALHEACNRGY-YDIAKQLLAAGAEVNTK----GLdddTPLHDAANNGhyKVVKLLLRYGGNPQQSNRKGETPLKVA 114
Cdd:PHA02878   234 GNTPLHISVGYCKdYDILKLLLEHGVDVNAKsyilGL---TALHSSIKSE--RKLKLLLEYGADINSLNSYKLTPLSSA 307
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1993-2209 6.94e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.87  E-value: 6.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1993 PGPTGvlsGGDVPASTTEEPPAPPPQEASPQLSTEPEPSeetklDVVLEAAAETEVLADDSAPEASISNLVPAPS--PPE 2070
Cdd:PRK12323   365 PGQSG---GGAGPATAAAAPVAQPAPAAAAPAAAAPAPA-----APPAAPAAAPAAAAAARAVAAAPARRSPAPEalAAA 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2071 QQRPAGGGDEETEaedPSAAPCCAPDGPTTDGLAQAPNSAEAACVVAAVEGPPGniQPEATDPEPKPTSEAPKAPKVEEV 2150
Cdd:PRK12323   437 RQASARGPGGAPA---PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAA--APAPADDDPPPWEELPPEFASPAP 511
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958747106 2151 PQRMTRNRAQMLASQSKQGI--PATEKDSMPAPASRAKGRAPEEEDAQAQHPRKRRFQRSG 2209
Cdd:PRK12323   512 AQPDAAPAGWVAESIPDPATadPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
39-96 7.82e-04

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 44.89  E-value: 7.82e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1958747106   39 FAGWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRY 96
Cdd:PTZ00322   113 YDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVVQLLSRH 170
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1969-2184 7.95e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.87  E-value: 7.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1969 QAKDASDVEAETAEASPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQLSTEPEPSEETKLDVVLEAAAETev 2048
Cdd:PRK12323   371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAP-- 448
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2049 lADDSAPEASISNLVPAPSPPEQQRPAGGGDEETEAEdPSAAPCCAPDG-PTTDGLAQAPNSAEAACVVAAVEGPPGNIQ 2127
Cdd:PRK12323   449 -APAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAA-PAAAPAPADDDpPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958747106 2128 PEATDPEPKPT----SEAPKAPKVEEVPQRMTRNRAQMLASQSKQGIPATEKDSMPAPASR 2184
Cdd:PRK12323   527 PDPATADPDDAfetlAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAAR 587
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1691-2222 9.67e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 9.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1691 TAVVSATPPPDSVfsnLPPKSSPSPRGELLTPAIE--GALPPDLGLPL--DATEDQQATAAILPPEPSYLEPLDEGPFNT 1766
Cdd:PRK10263   328 TATQSWAAPVEPV---TQTPPVASVDVPPAQPTVAwqPVPGPQTGEPViaPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQ 404
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1767 VITEEPVEWTHSAAEQSLPSSLIASASETPVswPVGSELMLKSPQRFAESPKHFCPGEPLHSTTPGPFSAAEPTYPVSPG 1846
Cdd:PRK10263   405 QPYYAPAAEQPAQQPYYAPAPEQPAQQPYYA--PAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQ 482
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1847 SYPLPA---PEPALEEVKDGGTgaiPVAIAAAEGAAPYTAPTRLESFFSNC-----KPHPDAPLDTAPEPASVTTVAQVE 1918
Cdd:PRK10263   483 PVEQQPvvePEPVVEETKPARP---PLYYFEEVEEKRAREREQLAAWYQPIpepvkEPEPIKSSLKAPSVAAVPPVEAAA 559
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1919 ALGPLESSFLD-SSHSISALSQVEPVSWHEAFTSPEDDLDLGPFslPELPLQAKDASDVEAETAEASPAPPVESPPGPTG 1997
Cdd:PRK10263   560 AVSPLASGVKKaTLATGAAATVAAPVFSLANSGGPRPQVKEGIG--PQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKA 637
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1998 VLSGGDVPASTTEEPPAPPPQEASPQLSTEPEPSEETKLDVVLEAAAETEVLADDSAPEASISNLVPAPsppEQQRPAGg 2077
Cdd:PRK10263   638 REAQRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQT---QQQRYSG- 713
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2078 gdeeteaEDPSAA-PCCAPD---GPTTDGLAQAPNsaeaacvvaavegppgniqpeatDPEPKPTSEAPKAPKVEEVPQR 2153
Cdd:PRK10263   714 -------EQPAGAnPFSLDDfefSPMKALLDDGPH-----------------------EPLFTPIVEPVQQPQQPVAPQQ 763
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958747106 2154 MTRNRAQMLASQSKQGIPATEKDSMPAPASRAKGRAPEEEDAQAQHPRKRRFQRSGQQLQQQLNTSTQQ 2222
Cdd:PRK10263   764 QYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQ 832
PHA02876 PHA02876
ankyrin repeat protein; Provisional
56-99 1.20e-03

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 44.28  E-value: 1.20e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1958747106   56 IAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGN 99
Cdd:PHA02876   160 IAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLLSYGAD 203
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
716-802 1.21e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 44.11  E-value: 1.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  716 KEDSKEKKRDnKTREKRDFRDSFFRKRDRDCV--DRNSEKRRDHTEKQRSFpsylsEKDKKRRESAEGGRDRRDTLEGSR 793
Cdd:TIGR01642   17 RDRSSERPRR-RSRDRSRFRDRHRRSRERSYRedSRPRDRRRYDSRSPRSL-----RYSSVRRSRDRPRRRSRSVRSIEQ 90

                   ....*....
gi 1958747106  794 ERRDGRIRS 802
Cdd:TIGR01642   91 HRRRLRDRS 99
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
1973-2107 1.25e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 43.81  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1973 ASDVEAETAEASPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQL-STEPEPSEETKLDVVLEAAAETEVLAD 2051
Cdd:PRK13108   305 AAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDgESTPAVEETSEADIEREQPGDLAGQAP 384
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1958747106 2052 dSAPEASISNLVPAPSPPEQQRPAGGGDEETEAEDPSA--APCCAPDGPTTDGLAQAP 2107
Cdd:PRK13108   385 -AAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAApiPDPAKPDELAVAGPGDDP 441
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
495-943 1.50e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 43.90  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  495 LKLKSFTYEYEDSKQKPDKaiLLENdvsTENKLKVLKHDRDKLSKtKSEDKEWLFKDEKALKRMKDVSKDTSRAFreERD 574
Cdd:PRK03918   296 IKLSEFYEEYLDELREIEK--RLSR---LEEEINGIEERIKELEE-KEERLEELKKKLKELEKRLEELEERHELY--EEA 367
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  575 RPSKTERERSTKEKSPK-----EEKLRLYKEERKKKSKDRPFKLEKKNDMKEVSKEKEKAFREDKEKLKKEKLCRDDaaf 649
Cdd:PRK03918   368 KAKKEELERLKKRLTGLtpeklEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRE--- 444
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  650 ddycnksqfLDHEDTKfslsdDQQERWFSDLSDSSFDFKGEDSwdsvtDYRDIKSDSVAkliLETVKEDSKEKKRDNKTR 729
Cdd:PRK03918   445 ---------LTEEHRK-----ELLEEYTAELKRIEKELKEIEE-----KERKLRKELRE---LEKVLKKESELIKLKELA 502
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  730 EKRDFRDSFFRKRDRDCVDRNSEKRRDHTEKQRSFPSYLSEKdKKRRESAEGGRDRRDTLEgsRERRDGRIRSEEVHREd 809
Cdd:PRK03918   503 EQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSL-KKELEKLEELKKKLAELE--KKLDELEEELAELLKE- 578
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  810 LKECGcdstFKDKSDCDFT-KTLEPWER--------PHAAREKEKKDALEKDRKEKGRAE-KYKDKSGERERNEKSILEK 879
Cdd:PRK03918   579 LEELG----FESVEELEERlKELEPFYNeylelkdaEKELEREEKELKKLEEELDKAFEElAETEKRLEELRKELEELEK 654
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958747106  880 CQKDKEFEKCFKEKKDGKEKHKDTHSKdrktpFDQLREKKEKAFSSLisEDFSERKDDRKGKEK 943
Cdd:PRK03918   655 KYSEEEYEELREEYLELSRELAGLRAE-----LEELEKRREEIKKTL--EKLKEELEEREKAKK 711
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1779-2204 1.86e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 1.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1779 AAEQSLPSsLIASASETPVSWPVGSELMLKSPQRfaespkhfcpgeplHSTTPGPFSAAEPTYPVSPGSYPLPAPepale 1858
Cdd:PHA03307    16 EGGEFFPR-PPATPGDAADDLLSGSQGQLVSDSA--------------ELAAVTVVAGAAACDRFEPPTGPPPGP----- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1859 evkdggtGAIPVAIAAAEGAAPYTAPTRLESFFSNCKPHPDAPLDTAPEPASVTTVAQVEALGPLESSFLDSSHSISALS 1938
Cdd:PHA03307    76 -------GTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPP 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1939 QVEPVSWHEAFTSPEDDldlgpfslPELPLQAKDASDVEAETAEASPAPPVESP-----------PGPTGVLSGGDVPAS 2007
Cdd:PHA03307   149 AASPPAAGASPAAVASD--------AASSRQAALPLSSPEETARAPSSPPAEPPpstppaaasprPPRRSSPISASASSP 220
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2008 TTEEPPAPPPQEASPQLSTEPEPSEETKLDVVLEAAAETEVLADDSAPEASISNLVPAPSPPEQQRPAGGGDEETEAEDP 2087
Cdd:PHA03307   221 APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSP 300
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2088 SAAPCCAPDGPTTDGLAQAPN-SAEAACVVAAVEGPPGNIQPEATDPEPKPTSEAPKAPkvEEVPQRMTRNRAQMLASQS 2166
Cdd:PHA03307   301 SSPGSGPAPSSPRASSSSSSSrESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPP--ADPSSPRKRPRPSRAPSSP 378
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 1958747106 2167 kqgiPATEKDSMPAPASRAKGRAPEEEDAQAQHPRKRR 2204
Cdd:PHA03307   379 ----AASAGRPTRRRARAAVAGRARRRDATGRFPAGRP 412
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
2062-2192 2.50e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 43.13  E-value: 2.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2062 LVPAPSPPEQQRPAGGGDEETEAED-PSAAPCCAPDGPTTDglAQAPNSAEAACVVAAVEGPPGNIQPEATdPEPK-PTS 2139
Cdd:PRK14959   361 MLPRLMPVESLRPSGGGASAPSGSAaEGPASGGAATIPTPG--TQGPQGTAPAAGMTPSSAAPATPAPSAA-PSPRvPWD 437
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1958747106 2140 EAPKAPKVEEVPqrmTRNRAQMLASQSKQGIPATEKDSMPAPASRAKGRAPEE 2192
Cdd:PRK14959   438 DAPPAPPRSGIP---PRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAE 487
PHA03379 PHA03379
EBNA-3A; Provisional
1750-2173 2.95e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 43.12  E-value: 2.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1750 PPEPSYLEPldegpfntvitEEPVEWTHSAAEQSLPSSLIASASETPVSWPVgselmlkspQRFAESPKHfcpgePLHST 1829
Cdd:PHA03379   408 ASEPTYGTP-----------RPPVEKPRPEVPQSLETATSHGSAQVPEPPPV---------HDLEPGPLH-----DQHSM 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1830 TPGPFSAAEPT--YPVSPGSYpLPAPepaleeVKDGGTGAIPVAIAAAEGAAPY-TAPTRLesffSNCKPHPDAPLDTAP 1906
Cdd:PHA03379   463 APCPVAQLPPGplQDLEPGDQ-LPGV------VQDGRPACAPVPAPAGPIVRPWeASLSQV----PGVAFAPVMPQPMPV 531
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1907 EPASVTTVAQVEALGPLESSFL-----DSSHSISALSQVEPVSWheaftSPEDDLDLGPFSLPELP------LQAKDASd 1975
Cdd:PHA03379   532 EPVPVPTVALERPVCPAPPLIAmqgpgETSGIVRVRERWRPAPW-----TPNPPRSPSQMSVRDRLarlraeAQPYQAS- 605
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1976 VEAETAE---ASPAPPVESPPGP-------------TGVLSGGDVPAS-------TTEEPPAPPPQEASPQLSTEPEP-- 2030
Cdd:PHA03379   606 VEVQPPQltqVSPQQPMEYPLEPeqqmfpgspfsqvADVMRAGGVPAMqpqyfdlPLQQPISQGAPLAPLRASMGPVPpv 685
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2031 --SEETKLDVVLEAAAETEVLADDSAPEAsisnlvPAPSPPEQQRPAGGGDEETEAEDPSAAPCCAPDGPTTDGLAQ-AP 2107
Cdd:PHA03379   686 paTQPQYFDIPLTEPINQGASAAHFLPQQ------PMEGPLVPERWMFQGATLSQSVRPGVAQSQYFDLPLTQPINHgAP 759
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958747106 2108 nsaeaacvvaAVEGPPGNIQPEATDPEPKPTSEAPKAPKVEEVPQRMTRNRaQMLASQSKQGIPAT 2173
Cdd:PHA03379   760 ----------AAHFLHQPPMEGPWVPEQWMFQGAPPSQGTDVVQHQLDALG-YVLHVLNHPGVPVS 814
PHA03247 PHA03247
large tegument protein UL36; Provisional
1732-2198 3.20e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 3.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1732 LGLPLDATedqQATAAILPPEPSYLEPLDE-GPFNTVITEEPVewthsaaeqslpssliASASETPVSWPVGSELmlkSP 1810
Cdd:PHA03247  2460 LGAPFSLS---LLLGELFPGAPVYRRPAEArFPFAAGAAPDPG----------------GGGPPDPDAPPAPSRL---AP 2517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1811 QRFAESPkhfcPGEPLH-----------------STTPGPFSAAEPTYPVSPGSYPLPAPEPALEEVKDGGTGAIPvaIA 1873
Cdd:PHA03247  2518 AILPDEP----VGEPVHprmltwirgleelasddAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRP--DA 2591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1874 AAEGAAPYTAPTRLESFFSNCKPHPDAPLDTAPEPASVTTVAQVEALGPLESSFLDSSHSISALSQVEPVSWHEAFTSPE 1953
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG 2671
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1954 DDLdlGPFSLPELPLQAKDASDVEAETAEASPAPPvESPPGPtgvlsggdvpastteeppapPPQEASPQLSTEPEP--- 2030
Cdd:PHA03247  2672 RAA--QASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEP--------------------APHALVSATPLPPGPaaa 2728
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2031 ---SEETKLDVVLEAAAETEVL----------ADDSAPEASISNLVPA----------------------PSPPEQQRPA 2075
Cdd:PHA03247  2729 rqaSPALPAAPAPPAVPAGPATpggparparpPTTAGPPAPAPPAAPAagpprrltrpavaslsesreslPSPWDPADPP 2808
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2076 GGGDEETEAEDPSAAPcCAPDGPTTDGLAQAPNSAEAACVVAAVEG----PPGNIQPEATDPEPKPTSEAPKAPKVEEVP 2151
Cdd:PHA03247  2809 AAVLAPAAALPPAASP-AGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvaPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*..
gi 1958747106 2152 qrmtrnRAQMLASQSKQGIPATEKDSMPAPASRAKGRAPEEEDAQAQ 2198
Cdd:PHA03247  2888 ------RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
761-1092 3.32e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.03  E-value: 3.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  761 QRSFPSYLSEKDKKRRESAEG-------------GRDRRDTLEGSRERRDGRIRSEEVHREDLKECGCDS-TFKDKSDCD 826
Cdd:TIGR00618  151 QGEFAQFLKAKSKEKKELLMNlfpldqytqlalmEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERkQVLEKELKH 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  827 FTKTLEPWERPHAAREKEKKDALEKDRKEkgRAEKYKDKSGERERNEKSILEKCQKDKEFeKCFKEKKDGKEKHKDTHSK 906
Cdd:TIGR00618  231 LREALQQTQQSHAYLTQKREAQEEQLKKQ--QLLKQLRARIEELRAQEAVLEETQERINR-ARKAAPLAAHIKAVTQIEQ 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  907 DRKTPFDQLREKKEKaFSSLISEDFSERKDDRKGKEKSWYIADIFTDESEDEKEECVASSFK---TGETGDSQRAESLQE 983
Cdd:TIGR00618  308 QAQRIHTELQSKMRS-RAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIReisCQQHTLTQHIHTLQQ 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  984 KedgREHPSDRHRKASS------DRQHTEKPRDKEPKEKRKDRGAAEGG-KDKKEKIFEKHKEKKDKECAEKYKERKDRA 1056
Cdd:TIGR00618  387 Q---KTTLTQKLQSLCKeldilqREQATIDTRTSAFRDLQGQLAHAKKQqELQQRYAELCAAAITCTAQCEKLEKIHLQE 463
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1958747106 1057 SVDSAPEK----KNKQKLPEKVEKKHFVEDKAKSKHKEKP 1092
Cdd:TIGR00618  464 SAQSLKEReqqlQTKEQIHLQETRKKAVVLARLLELQEEP 503
Ank_3 pfam13606
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
41-69 3.89e-03

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities.


Pssm-ID: 463933 [Multi-domain]  Cd Length: 30  Bit Score: 36.85  E-value: 3.89e-03
                           10        20
                   ....*....|....*....|....*....
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNT 69
Cdd:pfam13606    2 GNTPLHLAARNGRLEIVKLLLENGADINA 30
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
720-811 5.44e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 41.80  E-value: 5.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106  720 KEKKRDNKTREKRDFRDSFFRKRDRDcVDRNSEKRRDHTEKQRSFPSYLSEKDKKRRESAEGGRDRRDTLEGSRERRDGR 799
Cdd:TIGR01642    3 EEPDREREKSRGRDRDRSSERPRRRS-RDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRR 81
                           90
                   ....*....|....*
gi 1958747106  800 ---IRSEEVHREDLK 811
Cdd:TIGR01642   82 srsVRSIEQHRRRLR 96
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1673-2008 6.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 6.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1673 SPDYGIPSPKVDTLHCPPTAVVSATPPPDSVF--SNLPPKSSP--SPRGELLTPAIEGALPPDLGLPLDATEDQQATAAI 1748
Cdd:PHA03307    34 DLLSGSQGQLVSDSAELAAVTVVAGAAACDRFepPTGPPPGPGteAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSS 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1749 LPPEPSYLEPLDEGPFNTVITEEPVEWTHSAAEQSLPSSLIASASETPVSWPVGSELMLKSPQRFAESPKHfCPGEPLHS 1828
Cdd:PHA03307   114 PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETAR-APSSPPAE 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1829 ---TTPGPFSAAEPTYPVSPGSYPLPAPEPALEEVKDGGTGAIPVAIAAAEGAAPYTAPTrlesffsNCKPHPDAPLDTA 1905
Cdd:PHA03307   193 pppSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPE-------NECPLPRPAPITL 265
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1906 PEPASVTTVAQVEALGPLESSfldSSHSISALSqvepvswheafTSPEDDLDLGPFSLPELPLQAKDASDVEAETAEASP 1985
Cdd:PHA03307   266 PTRIWEASGWNGPSSRPGPAS---SSSSPRERS-----------PSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS 331
                          330       340
                   ....*....|....*....|...
gi 1958747106 1986 APPVESPPGPTGVLSGGDVPAST 2008
Cdd:PHA03307   332 SSESSRGAAVSPGPSPSRSPSPS 354
PHA02876 PHA02876
ankyrin repeat protein; Provisional
43-127 6.61e-03

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 41.97  E-value: 6.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   43 TALHEACNRGYY-DIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVA----NSP 117
Cdd:PHA02876   343 TPLHQASTLDRNkDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEALSQKIGTALHFAlcgtNPY 422
                           90
                   ....*....|
gi 1958747106  118 TMVNLLLGKG 127
Cdd:PHA02876   423 MSVKTLIDRG 432
PLN03192 PLN03192
Voltage-dependent potassium channel; Provisional
41-127 6.72e-03

Voltage-dependent potassium channel; Provisional


Pssm-ID: 215625 [Multi-domain]  Cd Length: 823  Bit Score: 41.78  E-value: 6.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106   41 GWTALHEACNRGYYDIAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKV------------------------------- 89
Cdd:PLN03192   558 GRTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWNAISAKHHKIfrilyhfasisdphaagdllctaakrndlta 637
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1958747106   90 VKLLLRYGGNPQQSNRKGETPLKVA---NSPTMVNLLLGKG 127
Cdd:PLN03192   638 MKELLKQGLNVDSEDHQGATALQVAmaeDHVDMVRLLIMNG 678
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1960-2166 6.91e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 6.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 1960 PFSLPELPLQAKDASDVEAETAEASPAPPVESPPGPTGVLSGGDVPASTTEEPPAPPPQEASPQLSTEPEPSEETKLDVV 2039
Cdd:PRK12323   403 PAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPAR 482
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958747106 2040 LEAAAETEVLADDSAPEASISNLVPAPSpPEQQRPAGGGDEETEAEDPSAAPCCAPDGPTTDGLAQAPNSAEAACVVAAV 2119
Cdd:PRK12323   483 AAPAAAPAPADDDPPPWEELPPEFASPA-PAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVV 561
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1958747106 2120 EGPPgniqPEATDPEPKPTSEAPKAPKVEEVPqrmTRNRAQMLASQS 2166
Cdd:PRK12323   562 APRP----PRASASGLPDMFDGDWPALAARLP---VRGLAQQLARQS 601
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH