NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1237937751|ref|NP_001341835|]
View 

adenomatous polyposis coli protein isoform m [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
449-736 9.76e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 512.21  E-value: 9.76e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  449 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 526
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  527 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 603
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  604 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 683
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1237937751  684 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 736
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1940-2285 5.72e-108

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 348.79  E-value: 5.72e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1940 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2018
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2019 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2098
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2099 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2172
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2173 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2252
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1237937751 2253 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2285
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2387-2560 2.07e-88

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


:

Pssm-ID: 399141  Cd Length: 174  Bit Score: 285.74  E-value: 2.07e-88
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2387 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2466
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2467 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2546
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 1237937751 2547 SPKRHSGSYLVTSV 2560
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
753-852 1.17e-56

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 406923  Cd Length: 100  Bit Score: 191.66  E-value: 1.17e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  753 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 832
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 1237937751  833 GSNHGINQNVSQSLCQEDDY 852
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1463-1556 1.48e-46

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435479  Cd Length: 94  Bit Score: 162.70  E-value: 1.48e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1463 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1542
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 1237937751 1543 KLPNNEDRVRGSFA 1556
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
110-183 3.39e-45

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 158.09  E-value: 3.39e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1237937751  110 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 183
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1000-1085 2.43e-35

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435478  Cd Length: 89  Bit Score: 130.38  E-value: 2.43e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1000 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1077
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81

                   ....*...
gi 1237937751 1078 PSKSGAQT 1085
Cdd:pfam16633   82 PSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1590-1664 4.14e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435480  Cd Length: 81  Bit Score: 118.04  E-value: 4.14e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1237937751 1590 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1664
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1379-1432 1.61e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.33  E-value: 1.61e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 1379 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1432
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1354-1377 1.71e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 48.92  E-value: 1.71e-07
                           10        20
                   ....*....|....*....|....
gi 1237937751 1354 DMPRVYCVEGTPINFSTATSLSDL 1377
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
366-406 4.15e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.15e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1237937751   366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1750-1769 2.97e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 2.97e-05
                           10        20
                   ....*....|....*....|
gi 1237937751 1750 DSEDDLLQECISSAMPKKKK 1769
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
229-270 4.73e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 4.73e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 1237937751   229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 270
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
408-448 6.30e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.30e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1237937751  408 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 448
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1433-1454 1.14e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 38.34  E-value: 1.14e-03
                           10        20
                   ....*....|....*....|..
gi 1237937751 1433 NKAEEGDILAECINSAMPKGKS 1454
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1088-1110 3.92e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.92e-03
                           10        20
                   ....*....|....*....|....
gi 1237937751 1088 SPPEHY-VQETPLMFSRCTSVSSL 1110
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
PTZ00449 super family cl33186
104 kDa microneme/rhoptry antigen; Provisional
1008-1418 6.00e-03

104 kDa microneme/rhoptry antigen; Provisional


The actual alignment was detected with superfamily member PTZ00449:

Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.98  E-value: 6.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1008 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1084
Cdd:PTZ00449   480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1085 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1157
Cdd:PTZ00449   558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1158 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1226
Cdd:PTZ00449   638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1227 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1300
Cdd:PTZ00449   706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1301 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1377
Cdd:PTZ00449   786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 1237937751 1378 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1418
Cdd:PTZ00449   859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
57-109 7.30e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 7.30e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1237937751    57 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 109
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
974-991 7.34e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 35.82  E-value: 7.34e-03
                           10
                   ....*....|....*...
gi 1237937751  974 ETIQTYCVEDTPICFSRC 991
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
 
Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
449-736 9.76e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 512.21  E-value: 9.76e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  449 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 526
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  527 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 603
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  604 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 683
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1237937751  684 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 736
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1940-2285 5.72e-108

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 348.79  E-value: 5.72e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1940 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2018
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2019 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2098
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2099 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2172
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2173 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2252
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1237937751 2253 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2285
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2387-2560 2.07e-88

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


Pssm-ID: 399141  Cd Length: 174  Bit Score: 285.74  E-value: 2.07e-88
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2387 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2466
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2467 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2546
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 1237937751 2547 SPKRHSGSYLVTSV 2560
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
753-852 1.17e-56

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406923  Cd Length: 100  Bit Score: 191.66  E-value: 1.17e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  753 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 832
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 1237937751  833 GSNHGINQNVSQSLCQEDDY 852
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1463-1556 1.48e-46

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435479  Cd Length: 94  Bit Score: 162.70  E-value: 1.48e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1463 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1542
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 1237937751 1543 KLPNNEDRVRGSFA 1556
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
110-183 3.39e-45

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 158.09  E-value: 3.39e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1237937751  110 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 183
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1000-1085 2.43e-35

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435478  Cd Length: 89  Bit Score: 130.38  E-value: 2.43e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1000 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1077
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81

                   ....*...
gi 1237937751 1078 PSKSGAQT 1085
Cdd:pfam16633   82 PSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1590-1664 4.14e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435480  Cd Length: 81  Bit Score: 118.04  E-value: 4.14e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1237937751 1590 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1664
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1379-1432 1.61e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.33  E-value: 1.61e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 1379 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1432
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1354-1377 1.71e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 48.92  E-value: 1.71e-07
                           10        20
                   ....*....|....*....|....
gi 1237937751 1354 DMPRVYCVEGTPINFSTATSLSDL 1377
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
366-406 4.15e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.15e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1237937751   366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
366-406 5.64e-07

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.83  E-value: 5.64e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1237937751  366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
1968-2281 1.38e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 1.38e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1968 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2043
Cdd:PHA03247  2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2044 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2121
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2122 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2200
Cdd:PHA03247  2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2201 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2270
Cdd:PHA03247  2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
                          330
                   ....*....|.
gi 1237937751 2271 HSSSLPRVSTW 2281
Cdd:PHA03247  2997 TGHSLSRVSSW 3007
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1750-1769 2.97e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 2.97e-05
                           10        20
                   ....*....|....*....|
gi 1237937751 1750 DSEDDLLQECISSAMPKKKK 1769
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
229-270 4.73e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 4.73e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 1237937751   229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 270
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
229-269 5.44e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 5.44e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1237937751  229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 269
Cdd:pfam00514    1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
408-448 6.30e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.30e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1237937751  408 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 448
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1433-1454 1.14e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 38.34  E-value: 1.14e-03
                           10        20
                   ....*....|....*....|..
gi 1237937751 1433 NKAEEGDILAECINSAMPKGKS 1454
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1088-1110 3.92e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.92e-03
                           10        20
                   ....*....|....*....|....
gi 1237937751 1088 SPPEHY-VQETPLMFSRCTSVSSL 1110
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1008-1418 6.00e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.98  E-value: 6.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1008 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1084
Cdd:PTZ00449   480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1085 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1157
Cdd:PTZ00449   558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1158 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1226
Cdd:PTZ00449   638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1227 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1300
Cdd:PTZ00449   706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1301 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1377
Cdd:PTZ00449   786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 1237937751 1378 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1418
Cdd:PTZ00449   859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
57-109 7.30e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 7.30e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1237937751    57 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 109
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
974-991 7.34e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 35.82  E-value: 7.34e-03
                           10
                   ....*....|....*...
gi 1237937751  974 ETIQTYCVEDTPICFSRC 991
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
 
Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
449-736 9.76e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 512.21  E-value: 9.76e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  449 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 526
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  527 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 603
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  604 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 683
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1237937751  684 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 736
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1940-2285 5.72e-108

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 348.79  E-value: 5.72e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1940 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2018
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2019 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2098
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2099 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2172
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2173 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2252
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1237937751 2253 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2285
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2387-2560 2.07e-88

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


Pssm-ID: 399141  Cd Length: 174  Bit Score: 285.74  E-value: 2.07e-88
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2387 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2466
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2467 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2546
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 1237937751 2547 SPKRHSGSYLVTSV 2560
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
753-852 1.17e-56

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406923  Cd Length: 100  Bit Score: 191.66  E-value: 1.17e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751  753 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 832
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 1237937751  833 GSNHGINQNVSQSLCQEDDY 852
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1463-1556 1.48e-46

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435479  Cd Length: 94  Bit Score: 162.70  E-value: 1.48e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1463 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1542
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 1237937751 1543 KLPNNEDRVRGSFA 1556
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
110-183 3.39e-45

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 158.09  E-value: 3.39e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1237937751  110 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 183
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1000-1085 2.43e-35

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435478  Cd Length: 89  Bit Score: 130.38  E-value: 2.43e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1000 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1077
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81

                   ....*...
gi 1237937751 1078 PSKSGAQT 1085
Cdd:pfam16633   82 PSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1590-1664 4.14e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435480  Cd Length: 81  Bit Score: 118.04  E-value: 4.14e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1237937751 1590 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1664
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1379-1432 1.61e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.33  E-value: 1.61e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1237937751 1379 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1432
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1354-1377 1.71e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 48.92  E-value: 1.71e-07
                           10        20
                   ....*....|....*....|....
gi 1237937751 1354 DMPRVYCVEGTPINFSTATSLSDL 1377
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
366-406 4.15e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.15e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1237937751   366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
366-406 5.64e-07

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.83  E-value: 5.64e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1237937751  366 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 406
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
1968-2281 1.38e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 1.38e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1968 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2043
Cdd:PHA03247  2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2044 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2121
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2122 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2200
Cdd:PHA03247  2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2201 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2270
Cdd:PHA03247  2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
                          330
                   ....*....|.
gi 1237937751 2271 HSSSLPRVSTW 2281
Cdd:PHA03247  2997 TGHSLSRVSSW 3007
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1973-2118 8.36e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.71  E-value: 8.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1973 PASKSPSEGQTATTSPRGAKPSVKSELSPVA----RQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISP 2048
Cdd:PHA03307   278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPApsspRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2049 GRNGISPPNKLSQlPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2118
Cdd:PHA03307   358 PPADPSSPRKRPR-PSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAF 426
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1750-1769 2.97e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 2.97e-05
                           10        20
                   ....*....|....*....|
gi 1237937751 1750 DSEDDLLQECISSAMPKKKK 1769
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1966-2261 5.94e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.63  E-value: 5.94e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1966 KGPPLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQP---LSRPIQSPG 2042
Cdd:PHA03307   101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASsrqAALPLSSPE 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2043 RNSISPGRNGISPPNKLSQLPRTSSP----STASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSE----SA 2114
Cdd:PHA03307   181 ETARAPSSPPAEPPPSTPPAAASPRPprrsSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEcplpRP 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2115 SKGLNQMNNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRQStfiKEAPSPTLRRKLEESASFESLSPSSRPASPTRS 2194
Cdd:PHA03307   261 APITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG---SGPAPSSPRASSSSSSSRESSSSSTSSSSESSR 337
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1237937751 2195 QAQTPVLSPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEY---NDGRP---AKRHDIARSH--SESPSRLPINRS 2261
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSpaaSAGRPtrrRARAAVAGRArrRDATGRFPAGRP 412
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
229-270 4.73e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 4.73e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 1237937751   229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 270
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
229-269 5.44e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 5.44e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1237937751  229 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 269
Cdd:pfam00514    1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
408-448 6.30e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.30e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1237937751  408 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 448
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1433-1454 1.14e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 38.34  E-value: 1.14e-03
                           10        20
                   ....*....|....*....|..
gi 1237937751 1433 NKAEEGDILAECINSAMPKGKS 1454
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1971-2231 2.18e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 2.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1971 KTPASKSPSEGQTATTSPRGAKPsvkselSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISPgr 2050
Cdd:pfam17823  151 RANASAAPRAAIAAASAPHAASP------APRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP-- 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2051 ngiSPPNKLSQLPrTSSPStASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSiPRSESASKGLNQMNNGNGANKk 2130
Cdd:pfam17823  223 ---AAGTALAAVG-NSSPA-AGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGD-PHARRLSPAKHMPSDTMARNP- 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2131 veLSRMSSTKSSGSESDRSERPVLvrqSTFIKEAPSPTlRRKLEESASFESLSPSSRPASPTRSQAQTPVLSPsLPDMSL 2210
Cdd:pfam17823  296 --AAPMGAQAQGPIIQVSTDQPVH---NTAGEPTPSPS-NTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASP-VPVLHT 368
                          250       260
                   ....*....|....*....|.
gi 1237937751 2211 STHSSVQAGGWRKLPPNLSPT 2231
Cdd:pfam17823  369 SMIPEVEATSPTTQPSPLLPT 389
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1088-1110 3.92e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.92e-03
                           10        20
                   ....*....|....*....|....
gi 1237937751 1088 SPPEHY-VQETPLMFSRCTSVSSL 1110
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1969-2285 5.84e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 5.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1969 PLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQiGGSSKAPSRSGsrDSTPSRPAQQPLSRPIQSPGRNSISP 2048
Cdd:PHA03307    63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAR-EGSPTPPGPSS--PDPPPPTPPPASPPPSPAPDLSEMLR 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2049 GRNGISPPNKLSQLPRTSSPS-TASTKSSGSGKM---------SYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2118
Cdd:PHA03307   140 PVGSPGPPPAASPPAAGASPAaVASDAASSRQAAlplsspeetARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2119 NQMNNGNGANKKVELSRMSSTKSSGSESD---RSERPVLVRQSTfikeaPSPTLRRKLEESASFESLSPSSRPASPTRSQ 2195
Cdd:PHA03307   220 PAPAPGRSAADDAGASSSDSSSSESSGCGwgpENECPLPRPAPI-----TLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2196 AqtPVLSPSLPDMSLSTHSSVQAGGWRKLPP-NLSPTIEYNDGRPAKRHDIARSHSESPS----RLPINRSGTWKREHSK 2270
Cdd:PHA03307   295 S--PSPSPSSPGSGPAPSSPRASSSSSSSREsSSSSTSSSSESSRGAAVSPGPSPSRSPSpsrpPPPADPSSPRKRPRPS 372
                          330
                   ....*....|....*
gi 1237937751 2271 HSSSLPRVSTWRRTG 2285
Cdd:PHA03307   373 RAPSSPAASAGRPTR 387
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1008-1418 6.00e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.98  E-value: 6.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1008 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1084
Cdd:PTZ00449   480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1085 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1157
Cdd:PTZ00449   558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1158 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1226
Cdd:PTZ00449   638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1227 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1300
Cdd:PTZ00449   706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1301 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1377
Cdd:PTZ00449   786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 1237937751 1378 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1418
Cdd:PTZ00449   859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
57-109 7.30e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 7.30e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1237937751    57 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 109
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
974-991 7.34e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 35.82  E-value: 7.34e-03
                           10
                   ....*....|....*...
gi 1237937751  974 ETIQTYCVEDTPICFSRC 991
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
PHA03247 PHA03247
large tegument protein UL36; Provisional
1967-2257 7.53e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 7.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 1967 GPPLKTPASKSPSEGQTATTSPRGAKPSvkselsPVARQTSqIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSI 2046
Cdd:PHA03247  2603 DDRGDPRGPAPPSPLPPDTHAPDPPPPS------PSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQ 2675
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2047 SPgrngiSPPNKLSQ--LPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGLNQMNNG 2124
Cdd:PHA03247  2676 AS-----SPPQRPRRraARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAT 2750
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1237937751 2125 NGANKKVE--------LSRMSSTKSSGSESDRSERPVLVRQSTFIKEAPSPT--LRRKLEESASFESLSPSSRPAS---- 2190
Cdd:PHA03247  2751 PGGPARPArppttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWdpADPPAAVLAPAAALPPAASPAGplpp 2830
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1237937751 2191 PTRSQAQTPVLSPSLPDMSLSTHSSVQAGG--WRKLPPNLSPTIEYNDGRPAKRHDIARSHSESPSRLP 2257
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH