NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|30984464|ref|NP_851896|]
View 

large tegument protein [Macacine alphaherpesvirus 1]

Protein Classification

PHA03247 family protein( domain architecture ID 11476190)

PHA03247 family protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1-3288 0e+00

large tegument protein UL36; Provisional


:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 4248.23  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464     1 MAQRGDRGIVVTGARNQFAPDLEPGGAVSCMRSSLSFLSLVFDAGLRDALSAEAVDGCLVEGGAWTRASAGSDPPRMCSA 80
Cdd:PHA03247    1 MARRGDRGIVVTGARNQFAPDLEPGGAVSCMRSSLSFLSLVFDAGLRDALSAEAVDGCLVEGGAWTRASAGSGPPRMCSI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    81 VELPTFLEYPSGRGLRCVFSRLYGEVAFFGKPAPGLLETQCPAHDFFAGPWARRPLSYTLVTIGALGMGLYRDGDEAYLF 160
Cdd:PHA03247   81 VELPTFLEYPSGRGLRCVFSRVYGEVAFFGEPAPGLLETQCPAHDFFAGPWARRPLSYTLVTIGALGMGLYRDGDTAYLF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   161 DPHGLREGSPAFVAKIRAGEVYTYLTYYTQEHPEARWAGAMVFFVPAGPGPPATAALTAAVLQLYGASETYLQDEPFVER 240
Cdd:PHA03247  161 DPHGLREGSPAFVAKVRAGEVYTYLTYYTQDHPEARWAGAMVFFVPSGPGPAAPADLTAAALHLYGASETYLQDEPFVER 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   241 RVVVSHPLRGDDVAPAGAVAVGEEAGRAPAAARKAPPQTPPPKAAAPEPAGAADAGVWGAA---LAGAPLALPAPAPSDS 317
Cdd:PHA03247  241 RVVISHPLRGDIAAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAAlagAPLALPAPPDPPPPAP 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   318 AGGDDAEDDEDGAMEVVSPLPRPNQHYPLGFSKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFSRGSG 397
Cdd:PHA03247  321 AGDAEEEDDEDGAMEVVSPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   398 GDEQTRPaagpRPPTPASrpptpgapptpgapptpgapptpgapptpagpttassepptpagpttassepptpagpttas 477
Cdd:PHA03247  401 GDDQTRP----AAPVPAS-------------------------------------------------------------- 414
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   478 sePPTPaGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPtassepptpnpegapapssneqpPAAASTDE 557
Cdd:PHA03247  415 --VPTP-APTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATE-----------------------PAPDDPDD 468
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   558 ATQKALDALRDRQPPEPPCGSLTELLGRHPDTDGGVSRLAAHEAGIAREVTECSRLTINALRSPFPGSPGLLQHCIVFLF 637
Cdd:PHA03247  469 ATRKALDALRERRPPEPPGADLAELLGRHPDTAGTVVRLAAREAAIAREVAECSRLTINALRSPFPASPGLLQHCVIFLF 548
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   638 ERVLAFLIENGARTHAGAGAEGPASGLLDLTVSLLPRRTAVGDFLASTRMTLADVAAHLPLIQPVLDEGSIVGRLALAKL 717
Cdd:PHA03247  549 ERVLAFLIENGARTHAGAGAEGPAAALLDLTLSLLPRRTAVGDFLASTRMTLADVAAHLPLIQPVLDEGSGVGRLALAKL 628
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   718 VLVARDVIRTTDDFHGELAELERRLRATPPTEVYARLSEWLLERSKAGPDTLFAPATPTHPEPLLQRIQALAGFARREEV 797
Cdd:PHA03247  629 VLVARDVIRETDAFHGELAELERQLRATPPAEVYARLSEWLLERSRAGPDTLFAPATPTHPEPLLQRVQALAGFARGEEI 708
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   798 RAAAEDREVRGALDALARGVDAVARRSGPLTVAAVSPEEPGEGGGRPHPLSPEAIRVRLEQLRADGQKAVEGATREYFHR 877
Cdd:PHA03247  709 RAAAEDREVRGALDALARGVDAVARRAGPLTVAPAPAAPGGQGAPRPPPLSPEAIRVRLEQVRAQGQRAIEGAVKEYFHR 788
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   878 GAVYSAKALLAGDARDRRYHVASAPVVPVVQLLESLPAFDAHVQEVARRARVPAPPPLATSPPAELLRELVQRGRDLEAP 957
Cdd:PHA03247  789 GAVYSAKALLAGDARDRRFHVASAPVVPVVQLLESLPAFDAHVREVAQRARVPAPPPLATSPQAILLRELLQRGQDLEAP 868
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   958 ADLAAWLASLGDAAGQGLVVRKELDELAQAIYKINERTVRRSSGLAELERFEALDAALRGELESEAAFEPGGGDGAAAGG 1037
Cdd:PHA03247  869 ADLAAWLASLGDAAGQGLVERKELDELARAIHKINERQVRRSSGLAELERFEALDAALRQELESEAAFVPAPGAAPYADA 948
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1038 --LPAETRRLAEDALHQAKAMAAAKLTDELSPEARERLAARVRAIEAMLEEARARAEAAKAALARFFQKLQGVLRPLPDF 1115
Cdd:PHA03247  949 ggLSPETRRLAEDALRQAKAMAAAKLTDELSPEARERLRARARAIEAMLEEARERAEAARAARERFFQKLQGVLRPLPDF 1028
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1116 GGLRVAPAVLATLRADIPGGWTCLPDAAQAAPPEVRAALRADLWGLLGQYRGALEHPTADTAAALSGLHPNFAEVLRDLF 1195
Cdd:PHA03247 1029 GGLRAAPAVLATLRADLPGGWTDLPDAAQAAPPEVRAALRADLWGLLGQYREALEHPTPDTATALSGLHPNFVEVLRDLF 1108
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1196 PAAPETPLLVSFFSDHAPRVAQAVSEAIAAGSAAVATASPESTVEAAVRAQGVLADTVAALSPAVRDPACPLAFLVALAD 1275
Cdd:PHA03247 1109 PAAPETPLLVQFFSDHAPRIARAVSEAINAGSAAVATASPESTVDAAVRAHGVLADAVAALSPAVRDPACPLAFLVALAD 1188
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1276 SAAGYVKATRLALGARRAIARLGALGAAAADLAVAVRRENPQADGDRAALLEAAARAVEAARAGLAACEGEFGGLLHAEG 1355
Cdd:PHA03247 1189 SAAGYVKATRLALDARRAIARLGALGAAAADLAVAVRRENPQAEGDRAALLEAAARAVTAAREGLAACEGEFGGLLHAEG 1268
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1356 SAGDPSPSGRALQELGKVVAATRRRADELEAAAADLAEKLADRDARAGRERWAADVEAALDRVENRAEFDAVELRRLQAL 1435
Cdd:PHA03247 1269 SAGDPSPSGRALQELGKVVGATRRRADELEAAAADLAEKMAARRARASRERWAADVEAALDRVENRAEFDAVELRRLQAL 1348
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1436 AAQNKYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQHHPALPPLAAARRIDWGPAFGAAAETYAEMFRVDTE 1515
Cdd:PHA03247 1349 AATHGYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQRHPMLPPLAAIHRIDWGPAFGAAAETYAEMFRVDTE 1428
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1516 PLARLLRITGGLLDLAQAGGGFIDYHEAVSRLAEDLNGVPSLRHYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLD 1595
Cdd:PHA03247 1429 PLARLLRLAGGLLELAQAGGGFIDYHEAVSRLAEDLNGVPSLRRYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLD 1508
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1596 LAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAACVGALERVDQAPVKDTAYAEHVAFVARRDLGEAKDALVRAKQ 1675
Cdd:PHA03247 1509 LAAAAEQTSRLRNDPAAAAELVRTGVTLACPSEDALAACVGALERVDQSPVKDTAYAEYVAFVARRDLAEAKDALVRAKQ 1588
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1676 QRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRVAAIPAAAAKTLEQARSVAEIVDQIELLLEQTEKAAELDQA 1755
Cdd:PHA03247 1589 QRAEATDRVTAALREALAAHERRAQSEAESLANLKTLLRVAAIPATAAKTLDQARSVAEIVDQIELLLEQTEKAAELDVA 1668
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1756 AVDWLEHARRVFEAHPLTAARDGSPDPLARLHARLDALGETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAWKSPEAL 1835
Cdd:PHA03247 1669 AVDWLEHARRVFEAHPLTAARGGGPDPLARLHARLDALGETRRRTEALRRSLEAAEAEWDEVWGRFGRVRGGAWKSPEAL 1748
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1836 GAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAIGAKSAERAGAVEELGVAVERHDGLLARLREDVVARVPWEMNAD 1915
Cdd:PHA03247 1749 RAAREQLRALQTATNTVLGLRADAHYERLPAKYQGALGAKSAERAGAVEELGAAVARHDGLLARLREEVVARVPWEMNAD 1828
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1916 ALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLYSAYAKARAQtgAGGTPPPAPAPLLVDVRALEARARSPGERHE 1995
Cdd:PHA03247 1829 ALGRLLAEFDALAGDLAPWAVDEFRGARALVQHRLGLYSAYAKARAQ--TGAGGAAPPAPLLVDLRALEARARSPPERHE 1906
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1996 PDPRTVRGRGEAYLRARGDPGPLELREATSDLDLPFATSYLTPDGTPLQYVVCFPAVTDKLGALLMRAEAARARPPLPPE 2075
Cdd:PHA03247 1907 PDPRMVRRRGEAYLRASGDPGPLELREATSELDLPFATSYLAPDGTPLQYALCFPAVTDKLGALLMRPEAARARPPLPTE 1986
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2076 GLDSTQTLAAMCTVPLITQLQLALSDAQGDGFRLFGRFVRHRQPGWRDSAAAAAELYAALTATTLTREFGCRWDELGWER 2155
Cdd:PHA03247 1987 GLESTQTLAAMCTVPLITRLQLALSDAQGAGFRLFGRFVRHRQPAWRDSMAAAAELYAALVATTLTREFGCRWDELGWAR 2066
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2156 GAPAPAPLAEPAGTRRPRVTFNMNDVMVALVAGTPEHIYNFWRLDLVLQHEYMHLTLPAAWETGAGAILFVQRLTPHPSP 2235
Cdd:PHA03247 2067 GAAAPAPLAEPAGSRRPRVTFNENDVMVALVAGTPEHIYNFWRLDLVLQHEYMHLTLPAAWETGAGSILFVQRLTPHPDP 2146
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2236 EVRVLPAVAAGPPPATGLLFGTRLADWRHGKLSSSDPLAPWRAAPELAAGGAAAALGGLGGPRALVAVSVLGRMCLPSAA 2315
Cdd:PHA03247 2147 EVRVLPAVPAGPPPATGLLFGTRLADWRHGKLSESDPLAPWRAAPELAAGGAAAALGGLSGPRALVAVSVLGRMCLPSAA 2226
                        2330      2340      2350      2360      2370      2380      2390      2400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2316 LAALWSCMFPEGYSEYDSLDALLAARLGAGRPPDPQGGREDVPAPPAHALYRPSGQRVLVRGGAPDPAARVTVMDLVLAA 2395
Cdd:PHA03247 2227 LAALWSCMFPDGYSEYDSLDALLAARLGSGRTPDPQGGREDSPAPPAHALYRPSGQRVLVRGGAPDPAARVTVMDLVLAA 2306
                        2410      2420      2430      2440      2450      2460      2470      2480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2396 TLLGAPVVVALRSDPAFSKGSELELCVTLFDSRERGADAALREVVSSDVETWATDLLHADLNAIENACLAAQLPALSALI 2475
Cdd:PHA03247 2307 TLLGAPVVVALRSDPAFSRGSELELCVTLFDSRARGADAALREVVSSDVETWAVDLLHADLNPIENACLAAQLPALSALI 2386
                        2490      2500      2510      2520      2530      2540      2550      2560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2476 AARPLAGSPPCLVLVDISMAPLFVLWEQPDPPGPPDVRFVGSDEIEELPFVSPGADVLAGLAAEGDPFFARTILGAPFSL 2555
Cdd:PHA03247 2387 AARPLARSPPCLVLVDISMAPLFVLWEQPDPPGPPDVRFVGSEEIEELPFVSPGGDVLAGLAADGDPFFARTILGAPFSL 2466
                        2570      2580      2590      2600      2610      2620      2630      2640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2556 SRLLHETFPGAPVHRRPPEARFPFAAGAGPDPGGGAPPDPEAPPPPSHQTPAILPDEPVGETVHPRMLTWIRGLEELASD 2635
Cdd:PHA03247 2467 SLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASD 2546
                        2650      2660      2670      2680      2690      2700      2710      2720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2636 DAGDPPPPLPPAARPAAPDRSVPPPRPAPRPSEPAVQSRARRPDAPPQSARPRTPRDDPGPPAApsTLPPSPPPPAPHPP 2715
Cdd:PHA03247 2547 DAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRG--PAPPSPLPPDTHAP 2624
                        2730      2740      2750      2760      2770      2780      2790      2800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2716 DPPPPDPSPRPNELGGGETaaapsapdppppqtVPAPERPRDDRAPAPGRVSLPRRASRQGRPA-PSSPLQRPRRRAARP 2794
Cdd:PHA03247 2625 DPPPPSPSPAANEPDPHPP--------------PTVPPPERPRDDPAPGRVSRPRRARRLGRAAqASSPPQRPRRRAARP 2690
                        2810      2820      2830      2840      2850      2860      2870      2880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2795 AVGSLTNLADPHPPGPETPTptsptanphattasatpapppvaatgPTRPTSPATPTPTVAAAGRASappapaapAAPAA 2874
Cdd:PHA03247 2691 TVGSLTSLADPPPPPPTPEP--------------------------APHALVSATPLPPGPAAARQA--------SPALP 2736
                        2890      2900      2910      2920      2930      2940      2950      2960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2875 PAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAPAAVPVVVPAPTATLTATTTT 2954
Cdd:PHA03247 2737 AAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA 2816
                        2970      2980      2990      3000      3010      3020      3030      3040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2955 SPAPAATPASPVPTPTSSLPTPPSKPPAFFQPSLATGGSVAPGGDFRRRAPSRPTAAVPAAPSRPPARRLARPAVSRSTE 3034
Cdd:PHA03247 2817 ALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896
                        3050      3060      3070      3080      3090      3100      3110      3120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3035 SFALPPDELARPRTPEAPAPPTETEEAPVAERPAPPEPPQGRPPSPAAPDAGPAAASGPSGGVPAPRLGALVPGRVAVPR 3114
Cdd:PHA03247 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPR 2976
                        3130      3140      3150      3160      3170      3180      3190      3200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3115 RQIPPPA-PPREIPAPSPPPPRSHAPRVSSWASSLALHEEPDAGPVSLKQTLWPPDELDDASDDSSLDSDPERLDLGSLD 3193
Cdd:PHA03247 2977 FRVPQPApSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALD 3056
                        3210      3220      3230      3240      3250      3260      3270      3280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3194 PETDPETDPFAHPPDPRAPEAGDRASPSSYFGPPPLSANAALSRRYVRSTGRSALAVLIEACWRIRRQLRMTRHALLNRS 3273
Cdd:PHA03247 3057 PLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLSANAALSRRYVRSTGRSALAVLIEACRRIRRQLRRTRHALLDRS 3136
                        3290
                  ....*....|....*
gi 30984464  3274 GAVLTGLYHVRMLLG 3288
Cdd:PHA03247 3137 GAVLTGLYHVRMLLG 3151
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1-3288 0e+00

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 4248.23  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464     1 MAQRGDRGIVVTGARNQFAPDLEPGGAVSCMRSSLSFLSLVFDAGLRDALSAEAVDGCLVEGGAWTRASAGSDPPRMCSA 80
Cdd:PHA03247    1 MARRGDRGIVVTGARNQFAPDLEPGGAVSCMRSSLSFLSLVFDAGLRDALSAEAVDGCLVEGGAWTRASAGSGPPRMCSI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    81 VELPTFLEYPSGRGLRCVFSRLYGEVAFFGKPAPGLLETQCPAHDFFAGPWARRPLSYTLVTIGALGMGLYRDGDEAYLF 160
Cdd:PHA03247   81 VELPTFLEYPSGRGLRCVFSRVYGEVAFFGEPAPGLLETQCPAHDFFAGPWARRPLSYTLVTIGALGMGLYRDGDTAYLF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   161 DPHGLREGSPAFVAKIRAGEVYTYLTYYTQEHPEARWAGAMVFFVPAGPGPPATAALTAAVLQLYGASETYLQDEPFVER 240
Cdd:PHA03247  161 DPHGLREGSPAFVAKVRAGEVYTYLTYYTQDHPEARWAGAMVFFVPSGPGPAAPADLTAAALHLYGASETYLQDEPFVER 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   241 RVVVSHPLRGDDVAPAGAVAVGEEAGRAPAAARKAPPQTPPPKAAAPEPAGAADAGVWGAA---LAGAPLALPAPAPSDS 317
Cdd:PHA03247  241 RVVISHPLRGDIAAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAAlagAPLALPAPPDPPPPAP 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   318 AGGDDAEDDEDGAMEVVSPLPRPNQHYPLGFSKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFSRGSG 397
Cdd:PHA03247  321 AGDAEEEDDEDGAMEVVSPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   398 GDEQTRPaagpRPPTPASrpptpgapptpgapptpgapptpgapptpagpttassepptpagpttassepptpagpttas 477
Cdd:PHA03247  401 GDDQTRP----AAPVPAS-------------------------------------------------------------- 414
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   478 sePPTPaGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPtassepptpnpegapapssneqpPAAASTDE 557
Cdd:PHA03247  415 --VPTP-APTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATE-----------------------PAPDDPDD 468
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   558 ATQKALDALRDRQPPEPPCGSLTELLGRHPDTDGGVSRLAAHEAGIAREVTECSRLTINALRSPFPGSPGLLQHCIVFLF 637
Cdd:PHA03247  469 ATRKALDALRERRPPEPPGADLAELLGRHPDTAGTVVRLAAREAAIAREVAECSRLTINALRSPFPASPGLLQHCVIFLF 548
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   638 ERVLAFLIENGARTHAGAGAEGPASGLLDLTVSLLPRRTAVGDFLASTRMTLADVAAHLPLIQPVLDEGSIVGRLALAKL 717
Cdd:PHA03247  549 ERVLAFLIENGARTHAGAGAEGPAAALLDLTLSLLPRRTAVGDFLASTRMTLADVAAHLPLIQPVLDEGSGVGRLALAKL 628
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   718 VLVARDVIRTTDDFHGELAELERRLRATPPTEVYARLSEWLLERSKAGPDTLFAPATPTHPEPLLQRIQALAGFARREEV 797
Cdd:PHA03247  629 VLVARDVIRETDAFHGELAELERQLRATPPAEVYARLSEWLLERSRAGPDTLFAPATPTHPEPLLQRVQALAGFARGEEI 708
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   798 RAAAEDREVRGALDALARGVDAVARRSGPLTVAAVSPEEPGEGGGRPHPLSPEAIRVRLEQLRADGQKAVEGATREYFHR 877
Cdd:PHA03247  709 RAAAEDREVRGALDALARGVDAVARRAGPLTVAPAPAAPGGQGAPRPPPLSPEAIRVRLEQVRAQGQRAIEGAVKEYFHR 788
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   878 GAVYSAKALLAGDARDRRYHVASAPVVPVVQLLESLPAFDAHVQEVARRARVPAPPPLATSPPAELLRELVQRGRDLEAP 957
Cdd:PHA03247  789 GAVYSAKALLAGDARDRRFHVASAPVVPVVQLLESLPAFDAHVREVAQRARVPAPPPLATSPQAILLRELLQRGQDLEAP 868
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   958 ADLAAWLASLGDAAGQGLVVRKELDELAQAIYKINERTVRRSSGLAELERFEALDAALRGELESEAAFEPGGGDGAAAGG 1037
Cdd:PHA03247  869 ADLAAWLASLGDAAGQGLVERKELDELARAIHKINERQVRRSSGLAELERFEALDAALRQELESEAAFVPAPGAAPYADA 948
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1038 --LPAETRRLAEDALHQAKAMAAAKLTDELSPEARERLAARVRAIEAMLEEARARAEAAKAALARFFQKLQGVLRPLPDF 1115
Cdd:PHA03247  949 ggLSPETRRLAEDALRQAKAMAAAKLTDELSPEARERLRARARAIEAMLEEARERAEAARAARERFFQKLQGVLRPLPDF 1028
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1116 GGLRVAPAVLATLRADIPGGWTCLPDAAQAAPPEVRAALRADLWGLLGQYRGALEHPTADTAAALSGLHPNFAEVLRDLF 1195
Cdd:PHA03247 1029 GGLRAAPAVLATLRADLPGGWTDLPDAAQAAPPEVRAALRADLWGLLGQYREALEHPTPDTATALSGLHPNFVEVLRDLF 1108
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1196 PAAPETPLLVSFFSDHAPRVAQAVSEAIAAGSAAVATASPESTVEAAVRAQGVLADTVAALSPAVRDPACPLAFLVALAD 1275
Cdd:PHA03247 1109 PAAPETPLLVQFFSDHAPRIARAVSEAINAGSAAVATASPESTVDAAVRAHGVLADAVAALSPAVRDPACPLAFLVALAD 1188
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1276 SAAGYVKATRLALGARRAIARLGALGAAAADLAVAVRRENPQADGDRAALLEAAARAVEAARAGLAACEGEFGGLLHAEG 1355
Cdd:PHA03247 1189 SAAGYVKATRLALDARRAIARLGALGAAAADLAVAVRRENPQAEGDRAALLEAAARAVTAAREGLAACEGEFGGLLHAEG 1268
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1356 SAGDPSPSGRALQELGKVVAATRRRADELEAAAADLAEKLADRDARAGRERWAADVEAALDRVENRAEFDAVELRRLQAL 1435
Cdd:PHA03247 1269 SAGDPSPSGRALQELGKVVGATRRRADELEAAAADLAEKMAARRARASRERWAADVEAALDRVENRAEFDAVELRRLQAL 1348
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1436 AAQNKYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQHHPALPPLAAARRIDWGPAFGAAAETYAEMFRVDTE 1515
Cdd:PHA03247 1349 AATHGYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQRHPMLPPLAAIHRIDWGPAFGAAAETYAEMFRVDTE 1428
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1516 PLARLLRITGGLLDLAQAGGGFIDYHEAVSRLAEDLNGVPSLRHYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLD 1595
Cdd:PHA03247 1429 PLARLLRLAGGLLELAQAGGGFIDYHEAVSRLAEDLNGVPSLRRYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLD 1508
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1596 LAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAACVGALERVDQAPVKDTAYAEHVAFVARRDLGEAKDALVRAKQ 1675
Cdd:PHA03247 1509 LAAAAEQTSRLRNDPAAAAELVRTGVTLACPSEDALAACVGALERVDQSPVKDTAYAEYVAFVARRDLAEAKDALVRAKQ 1588
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1676 QRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRVAAIPAAAAKTLEQARSVAEIVDQIELLLEQTEKAAELDQA 1755
Cdd:PHA03247 1589 QRAEATDRVTAALREALAAHERRAQSEAESLANLKTLLRVAAIPATAAKTLDQARSVAEIVDQIELLLEQTEKAAELDVA 1668
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1756 AVDWLEHARRVFEAHPLTAARDGSPDPLARLHARLDALGETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAWKSPEAL 1835
Cdd:PHA03247 1669 AVDWLEHARRVFEAHPLTAARGGGPDPLARLHARLDALGETRRRTEALRRSLEAAEAEWDEVWGRFGRVRGGAWKSPEAL 1748
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1836 GAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAIGAKSAERAGAVEELGVAVERHDGLLARLREDVVARVPWEMNAD 1915
Cdd:PHA03247 1749 RAAREQLRALQTATNTVLGLRADAHYERLPAKYQGALGAKSAERAGAVEELGAAVARHDGLLARLREEVVARVPWEMNAD 1828
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1916 ALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLYSAYAKARAQtgAGGTPPPAPAPLLVDVRALEARARSPGERHE 1995
Cdd:PHA03247 1829 ALGRLLAEFDALAGDLAPWAVDEFRGARALVQHRLGLYSAYAKARAQ--TGAGGAAPPAPLLVDLRALEARARSPPERHE 1906
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1996 PDPRTVRGRGEAYLRARGDPGPLELREATSDLDLPFATSYLTPDGTPLQYVVCFPAVTDKLGALLMRAEAARARPPLPPE 2075
Cdd:PHA03247 1907 PDPRMVRRRGEAYLRASGDPGPLELREATSELDLPFATSYLAPDGTPLQYALCFPAVTDKLGALLMRPEAARARPPLPTE 1986
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2076 GLDSTQTLAAMCTVPLITQLQLALSDAQGDGFRLFGRFVRHRQPGWRDSAAAAAELYAALTATTLTREFGCRWDELGWER 2155
Cdd:PHA03247 1987 GLESTQTLAAMCTVPLITRLQLALSDAQGAGFRLFGRFVRHRQPAWRDSMAAAAELYAALVATTLTREFGCRWDELGWAR 2066
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2156 GAPAPAPLAEPAGTRRPRVTFNMNDVMVALVAGTPEHIYNFWRLDLVLQHEYMHLTLPAAWETGAGAILFVQRLTPHPSP 2235
Cdd:PHA03247 2067 GAAAPAPLAEPAGSRRPRVTFNENDVMVALVAGTPEHIYNFWRLDLVLQHEYMHLTLPAAWETGAGSILFVQRLTPHPDP 2146
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2236 EVRVLPAVAAGPPPATGLLFGTRLADWRHGKLSSSDPLAPWRAAPELAAGGAAAALGGLGGPRALVAVSVLGRMCLPSAA 2315
Cdd:PHA03247 2147 EVRVLPAVPAGPPPATGLLFGTRLADWRHGKLSESDPLAPWRAAPELAAGGAAAALGGLSGPRALVAVSVLGRMCLPSAA 2226
                        2330      2340      2350      2360      2370      2380      2390      2400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2316 LAALWSCMFPEGYSEYDSLDALLAARLGAGRPPDPQGGREDVPAPPAHALYRPSGQRVLVRGGAPDPAARVTVMDLVLAA 2395
Cdd:PHA03247 2227 LAALWSCMFPDGYSEYDSLDALLAARLGSGRTPDPQGGREDSPAPPAHALYRPSGQRVLVRGGAPDPAARVTVMDLVLAA 2306
                        2410      2420      2430      2440      2450      2460      2470      2480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2396 TLLGAPVVVALRSDPAFSKGSELELCVTLFDSRERGADAALREVVSSDVETWATDLLHADLNAIENACLAAQLPALSALI 2475
Cdd:PHA03247 2307 TLLGAPVVVALRSDPAFSRGSELELCVTLFDSRARGADAALREVVSSDVETWAVDLLHADLNPIENACLAAQLPALSALI 2386
                        2490      2500      2510      2520      2530      2540      2550      2560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2476 AARPLAGSPPCLVLVDISMAPLFVLWEQPDPPGPPDVRFVGSDEIEELPFVSPGADVLAGLAAEGDPFFARTILGAPFSL 2555
Cdd:PHA03247 2387 AARPLARSPPCLVLVDISMAPLFVLWEQPDPPGPPDVRFVGSEEIEELPFVSPGGDVLAGLAADGDPFFARTILGAPFSL 2466
                        2570      2580      2590      2600      2610      2620      2630      2640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2556 SRLLHETFPGAPVHRRPPEARFPFAAGAGPDPGGGAPPDPEAPPPPSHQTPAILPDEPVGETVHPRMLTWIRGLEELASD 2635
Cdd:PHA03247 2467 SLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASD 2546
                        2650      2660      2670      2680      2690      2700      2710      2720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2636 DAGDPPPPLPPAARPAAPDRSVPPPRPAPRPSEPAVQSRARRPDAPPQSARPRTPRDDPGPPAApsTLPPSPPPPAPHPP 2715
Cdd:PHA03247 2547 DAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRG--PAPPSPLPPDTHAP 2624
                        2730      2740      2750      2760      2770      2780      2790      2800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2716 DPPPPDPSPRPNELGGGETaaapsapdppppqtVPAPERPRDDRAPAPGRVSLPRRASRQGRPA-PSSPLQRPRRRAARP 2794
Cdd:PHA03247 2625 DPPPPSPSPAANEPDPHPP--------------PTVPPPERPRDDPAPGRVSRPRRARRLGRAAqASSPPQRPRRRAARP 2690
                        2810      2820      2830      2840      2850      2860      2870      2880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2795 AVGSLTNLADPHPPGPETPTptsptanphattasatpapppvaatgPTRPTSPATPTPTVAAAGRASappapaapAAPAA 2874
Cdd:PHA03247 2691 TVGSLTSLADPPPPPPTPEP--------------------------APHALVSATPLPPGPAAARQA--------SPALP 2736
                        2890      2900      2910      2920      2930      2940      2950      2960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2875 PAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAPAAVPVVVPAPTATLTATTTT 2954
Cdd:PHA03247 2737 AAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA 2816
                        2970      2980      2990      3000      3010      3020      3030      3040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2955 SPAPAATPASPVPTPTSSLPTPPSKPPAFFQPSLATGGSVAPGGDFRRRAPSRPTAAVPAAPSRPPARRLARPAVSRSTE 3034
Cdd:PHA03247 2817 ALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896
                        3050      3060      3070      3080      3090      3100      3110      3120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3035 SFALPPDELARPRTPEAPAPPTETEEAPVAERPAPPEPPQGRPPSPAAPDAGPAAASGPSGGVPAPRLGALVPGRVAVPR 3114
Cdd:PHA03247 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPR 2976
                        3130      3140      3150      3160      3170      3180      3190      3200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3115 RQIPPPA-PPREIPAPSPPPPRSHAPRVSSWASSLALHEEPDAGPVSLKQTLWPPDELDDASDDSSLDSDPERLDLGSLD 3193
Cdd:PHA03247 2977 FRVPQPApSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALD 3056
                        3210      3220      3230      3240      3250      3260      3270      3280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3194 PETDPETDPFAHPPDPRAPEAGDRASPSSYFGPPPLSANAALSRRYVRSTGRSALAVLIEACWRIRRQLRMTRHALLNRS 3273
Cdd:PHA03247 3057 PLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLSANAALSRRYVRSTGRSALAVLIEACRRIRRQLRRTRHALLDRS 3136
                        3290
                  ....*....|....*
gi 30984464  3274 GAVLTGLYHVRMLLG 3288
Cdd:PHA03247 3137 GAVLTGLYHVRMLLG 3151
Herpes_UL36 pfam03586
Herpesvirus UL36 tegument protein; The UL36 open reading frame (ORF) encodes the largest ...
1405-1654 1.55e-92

Herpesvirus UL36 tegument protein; The UL36 open reading frame (ORF) encodes the largest herpes simplex virus type 1 (HSV-1) protein, a 270-kDa polypeptide designated VP1/2, which is also a component of the virion tegument. A null mutation in the UL36 gene of herpes simplex virus type 1 results in accumulation of unenveloped DNA-filled capsids in the cytoplasm of infected cells. This family only covers a small central part of this large protein.


Pssm-ID: 427378 [Multi-domain]  Cd Length: 251  Bit Score: 301.01  E-value: 1.55e-92
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   1405 ERWAADVEAALDRVENRAEFDAVELRRLQALAAQNKYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQHhpAL 1484
Cdd:pfam03586    3 ERWRSDIRALLERDEDDGEFDLDELDRLRDEAETGGYDAVDLIKRARQVVDARAQLGLRALETVFAFNPYTPQNSQ--AL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   1485 PPLAAARRIDWGPAFGAAAETYAEMFRVDTEPLARLLRITGGLLDLAQAG-GGFIDYHEAVSRLAEDLNGVPSLRHYVPF 1563
Cdd:pfam03586   81 PPLALLESITWIDAFPGAADTYTYLFGVSVEKLKALLRIGEEILEAADAAnDGNIDYHAFVLTLSGDLFQVPALTEYVDF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   1564 FRRGHAEYLELCDRLDALRADVHRALGGVPLDLAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAACVGALERVDQ 1643
Cdd:pfam03586  161 YVRSYELFLDIRAALAELRADARGALGRVALEQLAALEEAAEVRRDPEAAKEALERGVRITLPSEDALTAMREGLKLEDK 240
                          250
                   ....*....|.
gi 30984464   1644 APVKDTAYAEH 1654
Cdd:pfam03586  241 KQFEGTAYLEY 251
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
1474-2013 5.02e-11

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 69.13  E-value: 5.02e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1474 YTPENQHHPALPPLAAARRIDWGPAFGAAAEtyaemfrvdTEPLARLLRITGGLLDLAQAGGGFIDYHEAVSRLAEDLNG 1553
Cdd:COG3321  853 YPGRGRRRVPLPTYPFQREDAAAALLAAALA---------AALAAAAALGALLLAALAAALAAALLALAAAAAAALALAA 923
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1554 VPSLRHYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLDLAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAA 1633
Cdd:COG3321  924 AALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALA 1003
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1634 CVGALERVDQAPVKDTAYAEHVAFVARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAESLANLKTLL 1713
Cdd:COG3321 1004 LLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAA 1083
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1714 RVAAIPAAAAkTLEQARSVAEIVDQIELLLEQTEKAAELDQAAVDWLEHARRVFEAHPLTAARDGSPDPLARLHARLDAL 1793
Cdd:COG3321 1084 LALAAALAAA-ALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAA 1162
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1794 GETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAWKSPEALGAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAIG 1873
Cdd:COG3321 1163 LAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAA 1242
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1874 AKSAERAGAVEELGVAVERHDGLLARLREDVVARVPWEMNADALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLY 1953
Cdd:COG3321 1243 AAVAALAAAAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALA 1322
                        490       500       510       520       530       540
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1954 SAYAKARAQTGAGGTPPPAPAPLLVDVRALEARARSPGERHEPDPRTVRGRGEAYLRARG 2013
Cdd:COG3321 1323 AALLAAALAALAAAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALALAALA 1382
Streccoc_I_II NF033804
antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins ...
453-552 2.87e-06

antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins with a glucan-binding domain, two types of repetitive regions, an isopeptide bond-forming domain associated with shear resistance, and a C-terminal LPXTG motif for anchoring to the cell wall. They occur in oral Streptococci, and tend to be major cell surface adhesins. Members of this family include SspA and SspB from Streptococcus gordonii, antigen I/II from S. mutans, etc.


Pssm-ID: 468188 [Multi-domain]  Cd Length: 1552  Bit Score: 53.41  E-value: 2.87e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   453 EPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAgRPPTPANPTASSEP---PTPAG-------RPPTPAGRPPT 522
Cdd:NF033804  876 EPSKPEEPTYETEKPLEPAPVAPTYENEPTPPVKTPDQP-EPSKPEEPTYETEKplePAPVApsyenepTPPVKTPDQPE 954
                          90       100       110
                  ....*....|....*....|....*....|
gi 30984464   523 PANPTASSEPPTPNPEGAPAPSSNEQPPAA 552
Cdd:NF033804  955 PSKPVEPTYDPLPTPPVAPTPKQLPTPPAV 984
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
476-572 1.76e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 46.92  E-value: 1.76e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   476 ASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQP-----P 550
Cdd:NF041121   17 RAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPGAALPvrvpaP 96
                          90       100
                  ....*....|....*....|..
gi 30984464   551 AAASTDEATQKALDALRDRQPP 572
Cdd:NF041121   97 PALPNPLELARALRPLKRRVPS 118
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
454-545 5.53e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.53  E-value: 5.53e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   454 PPTPAGPTTassepPTPAGPTTASSEPPTPAGRPPTPagrPPTPANPTASSEPPTPAgrpPTPAGRPPTPaNPTASSEPP 533
Cdd:NF033839  281 QDTPKEPGN-----KKPSAPKPGMQPSPQPEKKEVKP---EPETPKPEVKPQLEKPK---PEVKPQPEKP-KPEVKPQLE 348
                          90
                  ....*....|..
gi 30984464   534 TPNPEGAPAPSS 545
Cdd:NF033839  349 TPKPEVKPQPEK 360
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
455-543 1.85e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.99  E-value: 1.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTA-SSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPtassEPPTPAGRP----PTPAGRP-PTPANPTA 528
Cdd:NF033839  290 KKPSAPKPGmQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQP----EKPKPEVKPqletPKPEVKPqPEKPKPEV 365
                          90
                  ....*....|....*
gi 30984464   529 SSEPPTPNPEGAPAP 543
Cdd:NF033839  366 KPQPEKPKPEVKPQP 380
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
453-574 1.95e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.99  E-value: 1.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   453 EPPTPAGPTTASSEPPTPagptTASSEPPTPAGRP-PTPAGR----PPTPANPTASSEP----PTPAGRP----PTPAGR 519
Cdd:NF033839  302 SPQPEKKEVKPEPETPKP----EVKPQLEKPKPEVkPQPEKPkpevKPQLETPKPEVKPqpekPKPEVKPqpekPKPEVK 377
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464   520 P-PTPANPTASSEPPTPNPEGAPAPssnEQPPAAASTDEATQKAlDALRDRQPPEP 574
Cdd:NF033839  378 PqPETPKPEVKPQPEKPKPEVKPQP---EKPKPEVKPQPEKPKP-EVKPQPEKPKP 429
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
453-576 2.42e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 43.32  E-value: 2.42e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  453 EPPTPAGPTTASSEPPTPagPTTASSEPPTPaGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEP 532
Cdd:cd23959  128 ETHKTAQVAPPKAEPQTA--PVTPFGQLPMF-GQHPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSASPFATATDTAP 204
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....
gi 30984464  533 PTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPC 576
Cdd:cd23959  205 SSGAPDGFPAEASAPSPFAAPASAASFPAAPVANGEAATPTHAC 248
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
474-546 2.74e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 42.96  E-value: 2.74e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 30984464    474 TTASSEPPTPAGRPPTPAgrPPTPANPTASSEPPTPAGRPPtpaGRPPTPANPTASSEPPTPN-PEGAPAPSSN 546
Cdd:TIGR00601   81 TGKVAPPAATPTSAPTPT--PSPPASPASGMSAAPASAVEE---KSPSEESATATAPESPSTSvPSSGSDAAST 149
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
494-575 2.97e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 43.07  E-value: 2.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   494 PPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPE 573
Cdd:NF041121   16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPGAALPVRVPA 95

                  ..
gi 30984464   574 PP 575
Cdd:NF041121   96 PP 97
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
390-595 3.19e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.22  E-value: 3.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   390 TPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAgpTTASSEPPT 469
Cdd:NF033839  283 TPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPK--PEVKPQPEK 360
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   470 PAGPTTASSEPPTPAGRP----PTPAGRP-PTPANPTASSEP--PTPAGRP----PTPAGRP-PTPANPTASSEPPTPNP 537
Cdd:NF033839  361 PKPEVKPQPEKPKPEVKPqpetPKPEVKPqPEKPKPEVKPQPekPKPEVKPqpekPKPEVKPqPEKPKPEVKPQPEKPKP 440
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 30984464   538 EGAPAPSSN--EQPPAAASTDEATQKALDALRDRQPPEP----PCGSLTELLGRHPDTDGGVSR 595
Cdd:NF033839  441 EVKPQPEKPkpEVKPQPETPKPEVKPQPEKPKPEVKPQPekpkPDNSKPQADDKKPSTPNNLSK 504
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
400-603 5.46e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 42.45  E-value: 5.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   400 EQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPagptTASSE 479
Cdd:NF033839  359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKP----EVKPQ 434
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   480 PPTPAgrpPTPAGRPPTPaNPTASSEPPTPAgrpPTPAGRPPTPaNPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEAT 559
Cdd:NF033839  435 PEKPK---PEVKPQPEKP-KPEVKPQPETPK---PEVKPQPEKP-KPEVKPQPEKPKPDNSKPQADDKKPSTPNNLSKDK 506
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 30984464   560 QKALDALRDRQPPEPPCGSLtellgrhPDTdGGVSRLAAHEAGI 603
Cdd:NF033839  507 QPSNQASTNEKATNKPKKSL-------PST-GSISNLALEIAGL 542
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1-3288 0e+00

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 4248.23  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464     1 MAQRGDRGIVVTGARNQFAPDLEPGGAVSCMRSSLSFLSLVFDAGLRDALSAEAVDGCLVEGGAWTRASAGSDPPRMCSA 80
Cdd:PHA03247    1 MARRGDRGIVVTGARNQFAPDLEPGGAVSCMRSSLSFLSLVFDAGLRDALSAEAVDGCLVEGGAWTRASAGSGPPRMCSI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    81 VELPTFLEYPSGRGLRCVFSRLYGEVAFFGKPAPGLLETQCPAHDFFAGPWARRPLSYTLVTIGALGMGLYRDGDEAYLF 160
Cdd:PHA03247   81 VELPTFLEYPSGRGLRCVFSRVYGEVAFFGEPAPGLLETQCPAHDFFAGPWARRPLSYTLVTIGALGMGLYRDGDTAYLF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   161 DPHGLREGSPAFVAKIRAGEVYTYLTYYTQEHPEARWAGAMVFFVPAGPGPPATAALTAAVLQLYGASETYLQDEPFVER 240
Cdd:PHA03247  161 DPHGLREGSPAFVAKVRAGEVYTYLTYYTQDHPEARWAGAMVFFVPSGPGPAAPADLTAAALHLYGASETYLQDEPFVER 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   241 RVVVSHPLRGDDVAPAGAVAVGEEAGRAPAAARKAPPQTPPPKAAAPEPAGAADAGVWGAA---LAGAPLALPAPAPSDS 317
Cdd:PHA03247  241 RVVISHPLRGDIAAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAAlagAPLALPAPPDPPPPAP 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   318 AGGDDAEDDEDGAMEVVSPLPRPNQHYPLGFSKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFSRGSG 397
Cdd:PHA03247  321 AGDAEEEDDEDGAMEVVSPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   398 GDEQTRPaagpRPPTPASrpptpgapptpgapptpgapptpgapptpagpttassepptpagpttassepptpagpttas 477
Cdd:PHA03247  401 GDDQTRP----AAPVPAS-------------------------------------------------------------- 414
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   478 sePPTPaGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPtassepptpnpegapapssneqpPAAASTDE 557
Cdd:PHA03247  415 --VPTP-APTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATE-----------------------PAPDDPDD 468
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   558 ATQKALDALRDRQPPEPPCGSLTELLGRHPDTDGGVSRLAAHEAGIAREVTECSRLTINALRSPFPGSPGLLQHCIVFLF 637
Cdd:PHA03247  469 ATRKALDALRERRPPEPPGADLAELLGRHPDTAGTVVRLAAREAAIAREVAECSRLTINALRSPFPASPGLLQHCVIFLF 548
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   638 ERVLAFLIENGARTHAGAGAEGPASGLLDLTVSLLPRRTAVGDFLASTRMTLADVAAHLPLIQPVLDEGSIVGRLALAKL 717
Cdd:PHA03247  549 ERVLAFLIENGARTHAGAGAEGPAAALLDLTLSLLPRRTAVGDFLASTRMTLADVAAHLPLIQPVLDEGSGVGRLALAKL 628
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   718 VLVARDVIRTTDDFHGELAELERRLRATPPTEVYARLSEWLLERSKAGPDTLFAPATPTHPEPLLQRIQALAGFARREEV 797
Cdd:PHA03247  629 VLVARDVIRETDAFHGELAELERQLRATPPAEVYARLSEWLLERSRAGPDTLFAPATPTHPEPLLQRVQALAGFARGEEI 708
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   798 RAAAEDREVRGALDALARGVDAVARRSGPLTVAAVSPEEPGEGGGRPHPLSPEAIRVRLEQLRADGQKAVEGATREYFHR 877
Cdd:PHA03247  709 RAAAEDREVRGALDALARGVDAVARRAGPLTVAPAPAAPGGQGAPRPPPLSPEAIRVRLEQVRAQGQRAIEGAVKEYFHR 788
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   878 GAVYSAKALLAGDARDRRYHVASAPVVPVVQLLESLPAFDAHVQEVARRARVPAPPPLATSPPAELLRELVQRGRDLEAP 957
Cdd:PHA03247  789 GAVYSAKALLAGDARDRRFHVASAPVVPVVQLLESLPAFDAHVREVAQRARVPAPPPLATSPQAILLRELLQRGQDLEAP 868
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   958 ADLAAWLASLGDAAGQGLVVRKELDELAQAIYKINERTVRRSSGLAELERFEALDAALRGELESEAAFEPGGGDGAAAGG 1037
Cdd:PHA03247  869 ADLAAWLASLGDAAGQGLVERKELDELARAIHKINERQVRRSSGLAELERFEALDAALRQELESEAAFVPAPGAAPYADA 948
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1038 --LPAETRRLAEDALHQAKAMAAAKLTDELSPEARERLAARVRAIEAMLEEARARAEAAKAALARFFQKLQGVLRPLPDF 1115
Cdd:PHA03247  949 ggLSPETRRLAEDALRQAKAMAAAKLTDELSPEARERLRARARAIEAMLEEARERAEAARAARERFFQKLQGVLRPLPDF 1028
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1116 GGLRVAPAVLATLRADIPGGWTCLPDAAQAAPPEVRAALRADLWGLLGQYRGALEHPTADTAAALSGLHPNFAEVLRDLF 1195
Cdd:PHA03247 1029 GGLRAAPAVLATLRADLPGGWTDLPDAAQAAPPEVRAALRADLWGLLGQYREALEHPTPDTATALSGLHPNFVEVLRDLF 1108
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1196 PAAPETPLLVSFFSDHAPRVAQAVSEAIAAGSAAVATASPESTVEAAVRAQGVLADTVAALSPAVRDPACPLAFLVALAD 1275
Cdd:PHA03247 1109 PAAPETPLLVQFFSDHAPRIARAVSEAINAGSAAVATASPESTVDAAVRAHGVLADAVAALSPAVRDPACPLAFLVALAD 1188
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1276 SAAGYVKATRLALGARRAIARLGALGAAAADLAVAVRRENPQADGDRAALLEAAARAVEAARAGLAACEGEFGGLLHAEG 1355
Cdd:PHA03247 1189 SAAGYVKATRLALDARRAIARLGALGAAAADLAVAVRRENPQAEGDRAALLEAAARAVTAAREGLAACEGEFGGLLHAEG 1268
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1356 SAGDPSPSGRALQELGKVVAATRRRADELEAAAADLAEKLADRDARAGRERWAADVEAALDRVENRAEFDAVELRRLQAL 1435
Cdd:PHA03247 1269 SAGDPSPSGRALQELGKVVGATRRRADELEAAAADLAEKMAARRARASRERWAADVEAALDRVENRAEFDAVELRRLQAL 1348
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1436 AAQNKYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQHHPALPPLAAARRIDWGPAFGAAAETYAEMFRVDTE 1515
Cdd:PHA03247 1349 AATHGYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQRHPMLPPLAAIHRIDWGPAFGAAAETYAEMFRVDTE 1428
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1516 PLARLLRITGGLLDLAQAGGGFIDYHEAVSRLAEDLNGVPSLRHYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLD 1595
Cdd:PHA03247 1429 PLARLLRLAGGLLELAQAGGGFIDYHEAVSRLAEDLNGVPSLRRYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLD 1508
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1596 LAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAACVGALERVDQAPVKDTAYAEHVAFVARRDLGEAKDALVRAKQ 1675
Cdd:PHA03247 1509 LAAAAEQTSRLRNDPAAAAELVRTGVTLACPSEDALAACVGALERVDQSPVKDTAYAEYVAFVARRDLAEAKDALVRAKQ 1588
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1676 QRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRVAAIPAAAAKTLEQARSVAEIVDQIELLLEQTEKAAELDQA 1755
Cdd:PHA03247 1589 QRAEATDRVTAALREALAAHERRAQSEAESLANLKTLLRVAAIPATAAKTLDQARSVAEIVDQIELLLEQTEKAAELDVA 1668
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1756 AVDWLEHARRVFEAHPLTAARDGSPDPLARLHARLDALGETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAWKSPEAL 1835
Cdd:PHA03247 1669 AVDWLEHARRVFEAHPLTAARGGGPDPLARLHARLDALGETRRRTEALRRSLEAAEAEWDEVWGRFGRVRGGAWKSPEAL 1748
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1836 GAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAIGAKSAERAGAVEELGVAVERHDGLLARLREDVVARVPWEMNAD 1915
Cdd:PHA03247 1749 RAAREQLRALQTATNTVLGLRADAHYERLPAKYQGALGAKSAERAGAVEELGAAVARHDGLLARLREEVVARVPWEMNAD 1828
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1916 ALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLYSAYAKARAQtgAGGTPPPAPAPLLVDVRALEARARSPGERHE 1995
Cdd:PHA03247 1829 ALGRLLAEFDALAGDLAPWAVDEFRGARALVQHRLGLYSAYAKARAQ--TGAGGAAPPAPLLVDLRALEARARSPPERHE 1906
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1996 PDPRTVRGRGEAYLRARGDPGPLELREATSDLDLPFATSYLTPDGTPLQYVVCFPAVTDKLGALLMRAEAARARPPLPPE 2075
Cdd:PHA03247 1907 PDPRMVRRRGEAYLRASGDPGPLELREATSELDLPFATSYLAPDGTPLQYALCFPAVTDKLGALLMRPEAARARPPLPTE 1986
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2076 GLDSTQTLAAMCTVPLITQLQLALSDAQGDGFRLFGRFVRHRQPGWRDSAAAAAELYAALTATTLTREFGCRWDELGWER 2155
Cdd:PHA03247 1987 GLESTQTLAAMCTVPLITRLQLALSDAQGAGFRLFGRFVRHRQPAWRDSMAAAAELYAALVATTLTREFGCRWDELGWAR 2066
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2156 GAPAPAPLAEPAGTRRPRVTFNMNDVMVALVAGTPEHIYNFWRLDLVLQHEYMHLTLPAAWETGAGAILFVQRLTPHPSP 2235
Cdd:PHA03247 2067 GAAAPAPLAEPAGSRRPRVTFNENDVMVALVAGTPEHIYNFWRLDLVLQHEYMHLTLPAAWETGAGSILFVQRLTPHPDP 2146
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2236 EVRVLPAVAAGPPPATGLLFGTRLADWRHGKLSSSDPLAPWRAAPELAAGGAAAALGGLGGPRALVAVSVLGRMCLPSAA 2315
Cdd:PHA03247 2147 EVRVLPAVPAGPPPATGLLFGTRLADWRHGKLSESDPLAPWRAAPELAAGGAAAALGGLSGPRALVAVSVLGRMCLPSAA 2226
                        2330      2340      2350      2360      2370      2380      2390      2400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2316 LAALWSCMFPEGYSEYDSLDALLAARLGAGRPPDPQGGREDVPAPPAHALYRPSGQRVLVRGGAPDPAARVTVMDLVLAA 2395
Cdd:PHA03247 2227 LAALWSCMFPDGYSEYDSLDALLAARLGSGRTPDPQGGREDSPAPPAHALYRPSGQRVLVRGGAPDPAARVTVMDLVLAA 2306
                        2410      2420      2430      2440      2450      2460      2470      2480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2396 TLLGAPVVVALRSDPAFSKGSELELCVTLFDSRERGADAALREVVSSDVETWATDLLHADLNAIENACLAAQLPALSALI 2475
Cdd:PHA03247 2307 TLLGAPVVVALRSDPAFSRGSELELCVTLFDSRARGADAALREVVSSDVETWAVDLLHADLNPIENACLAAQLPALSALI 2386
                        2490      2500      2510      2520      2530      2540      2550      2560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2476 AARPLAGSPPCLVLVDISMAPLFVLWEQPDPPGPPDVRFVGSDEIEELPFVSPGADVLAGLAAEGDPFFARTILGAPFSL 2555
Cdd:PHA03247 2387 AARPLARSPPCLVLVDISMAPLFVLWEQPDPPGPPDVRFVGSEEIEELPFVSPGGDVLAGLAADGDPFFARTILGAPFSL 2466
                        2570      2580      2590      2600      2610      2620      2630      2640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2556 SRLLHETFPGAPVHRRPPEARFPFAAGAGPDPGGGAPPDPEAPPPPSHQTPAILPDEPVGETVHPRMLTWIRGLEELASD 2635
Cdd:PHA03247 2467 SLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASD 2546
                        2650      2660      2670      2680      2690      2700      2710      2720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2636 DAGDPPPPLPPAARPAAPDRSVPPPRPAPRPSEPAVQSRARRPDAPPQSARPRTPRDDPGPPAApsTLPPSPPPPAPHPP 2715
Cdd:PHA03247 2547 DAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRG--PAPPSPLPPDTHAP 2624
                        2730      2740      2750      2760      2770      2780      2790      2800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2716 DPPPPDPSPRPNELGGGETaaapsapdppppqtVPAPERPRDDRAPAPGRVSLPRRASRQGRPA-PSSPLQRPRRRAARP 2794
Cdd:PHA03247 2625 DPPPPSPSPAANEPDPHPP--------------PTVPPPERPRDDPAPGRVSRPRRARRLGRAAqASSPPQRPRRRAARP 2690
                        2810      2820      2830      2840      2850      2860      2870      2880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2795 AVGSLTNLADPHPPGPETPTptsptanphattasatpapppvaatgPTRPTSPATPTPTVAAAGRASappapaapAAPAA 2874
Cdd:PHA03247 2691 TVGSLTSLADPPPPPPTPEP--------------------------APHALVSATPLPPGPAAARQA--------SPALP 2736
                        2890      2900      2910      2920      2930      2940      2950      2960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2875 PAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAAPAPAAVPVVVPAPTATLTATTTT 2954
Cdd:PHA03247 2737 AAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA 2816
                        2970      2980      2990      3000      3010      3020      3030      3040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2955 SPAPAATPASPVPTPTSSLPTPPSKPPAFFQPSLATGGSVAPGGDFRRRAPSRPTAAVPAAPSRPPARRLARPAVSRSTE 3034
Cdd:PHA03247 2817 ALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896
                        3050      3060      3070      3080      3090      3100      3110      3120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3035 SFALPPDELARPRTPEAPAPPTETEEAPVAERPAPPEPPQGRPPSPAAPDAGPAAASGPSGGVPAPRLGALVPGRVAVPR 3114
Cdd:PHA03247 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPR 2976
                        3130      3140      3150      3160      3170      3180      3190      3200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3115 RQIPPPA-PPREIPAPSPPPPRSHAPRVSSWASSLALHEEPDAGPVSLKQTLWPPDELDDASDDSSLDSDPERLDLGSLD 3193
Cdd:PHA03247 2977 FRVPQPApSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALD 3056
                        3210      3220      3230      3240      3250      3260      3270      3280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3194 PETDPETDPFAHPPDPRAPEAGDRASPSSYFGPPPLSANAALSRRYVRSTGRSALAVLIEACWRIRRQLRMTRHALLNRS 3273
Cdd:PHA03247 3057 PLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLSANAALSRRYVRSTGRSALAVLIEACRRIRRQLRRTRHALLDRS 3136
                        3290
                  ....*....|....*
gi 30984464  3274 GAVLTGLYHVRMLLG 3288
Cdd:PHA03247 3137 GAVLTGLYHVRMLLG 3151
PHA03246 PHA03246
large tegument protein UL36; Provisional
6-2539 0e+00

large tegument protein UL36; Provisional


Pssm-ID: 223020 [Multi-domain]  Cd Length: 3095  Bit Score: 922.84  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464     6 DRGIVVTGARNQFAPDLEPGGAVSCMRSSLSFLSLVFDAGLRDALSAEAVDGCLVEGGAWTRASAGSDPPRMCSAVELPT 85
Cdd:PHA03246   73 DFTIVAVGIRNQFAPDLSPGSSVSCLRSSLAFLRIVFAYGLDTVLSADAIDRLLLQGKDWTIHTSDDGVYTTCVPHDLPN 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    86 FLEYPSGRGLRCV-FSRLYGEVAFF-GKPAPGLLETQCPAHDFFAGPWA-RRPLSYTLVTIGALGMGLYRDGDEAYLFDP 162
Cdd:PHA03246  153 RILSKDPGGNLCVaFSSSYGESEFYlDENTPTILDTQISARTFIERVWKqKRGDIYCLIVIGVLGIGVYRSEDGIYIFDP 232
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   163 HGLREGSPAFVAKIRAGEVYTYLTYYTQEHPEARWAGAMVFFVPAGPGPPATAALTAAVLQLYGASETYLQ----DEPfv 238
Cdd:PHA03246  233 HGHGHIGQACIVRVSEGYFYQYLTSYADPSGMPDWSATFVYFVSTASSAPPKEEIISAVSRLYGTADIVLDlgraDEE-- 310
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   239 ERRVVVSHplrgdDVAPagavavgeeagrapaaarkappqtpppkaaapepagaadagvwgaalagaplalpapapsdsa 318
Cdd:PHA03246  311 DNLKVVSA-----DFDP--------------------------------------------------------------- 322
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   319 ggddaeddedgamevvSPLPRPNQ-HYPLGFSKRR-RPTWTPPSSLEDLSAgrhhpkraslptrtrrsarHAATPF--SR 394
Cdd:PHA03246  323 ----------------PSRPQPWQtKIVIGTADSYaDSSPKLHSESTDLTP-------------------HEHGEYdpST 367
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   395 GSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAG--PTTASSEPPTPAG 472
Cdd:PHA03246  368 LVGGASTNINISDPPARTDCRRYSEGSVIHESVDSHIEDVTEATSVVAAWSDAFSDISEDYSHLTrpDLPATAHDVSKNG 447
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   473 PTTASSEPPTPAG---RPPTPAGRPPTpANPTASSEPPT--PAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSS-- 545
Cdd:PHA03246  448 HDTKSDRRSRGSNsrhKRRRPSWTPPS-SSENVSSDGPTfsQSRKPSRKSKRALDLDYGHLSNEPSDVDGENSDSPAGai 526
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   546 ---------NEQPPAAASTDEATQKALdaLRDRQP--PEPPCGSLTELL--GRHPDTDGGVSRLAAHEAGIarEVTECS- 611
Cdd:PHA03246  527 snipdnvsfNEFISSQARAEDSIEHLS--LRNRPVfnPHTVTGNLDNTLrdSLWNDEYSGSYPLSDISDMI--DDITESi 602
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   612 ----RLTINaLRSPFPGSPGLLQHCIVFLFERVLAFLIENGARTHAGAGAEGPASgLLDLTVSLlPRRTAVGDFLASTRM 687
Cdd:PHA03246  603 adgmRLVVH-TSHRGDYDGGLLDVCIMDVFTRLFNYIIENGARTTTDRASVVEPE-MAALTKAF-TTPSHFSTFISSTGM 679
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   688 TLADVAAHLPLIQPVLDEGSIVGRLALAKLVLVARDVIRTTDDFHGELAELERRLRATPPTEVYARLSEWLLERSKAGPD 767
Cdd:PHA03246  680 SLSEASESADLIESVLTENSKIGKLALSKLTLVALEVEEATDTLHKSLDAIEREASTADPDRIYERMATVLVDTFYRNSG 759
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   768 TLFAPATPTHPE-PLLQRIQALAGFARREEVRAAAEDREVRGALDALARGVDAVARRSGPLTVAAVSPEEPGEGGGRPHP 846
Cdd:PHA03246  760 KLYSETTSYNSDqTLTDRVVSLCEFIRDRENVATRKAELILAEIEALDAGVRWMNTKFDAFIMGGSGGMSPAKGLSAIAK 839
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   847 LSPEAIRVRLEQLRADGQKAVEGATREYFHRGAVYSAKALLAGDARDRRYH-VASAPVVPVVQLLESLPAFDAHVQEVAR 925
Cdd:PHA03246  840 TSTDVVTQRLESIGKAAIDIVGNALREYYLKCALYSAKALRADMKETSRFKiVVTEQLEKMTRLLSSLSVIDDVILLIAS 919
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   926 RARVPAPPPLATSPPAELLRELVQRGRDLEAPADLAAWLASLGDAAGQGLVVRKELDELAQAIYKINERTVRRSSGLAEL 1005
Cdd:PHA03246  920 RSDARVPVPVSKALESQLLGDLLEIGSHLDVPENLVTWKGLMVSVQTGGWISRRELDLLLKEVDAVNDNAARRETALTEL 999
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1006 ERFEALDAALRGELESEAAFEPGGGDgaaagglpaETRRLAEDALHQAKAMAAAKLTDELSPEARERLAARVRAIEAMLE 1085
Cdd:PHA03246 1000 ERLHELESRIASYTDLETTVDLQKLD---------EALKLANSIVKLTKGLDGAKLASSLSSDIREKIRQKRSETETLIA 1070
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1086 EARARAEAAKAALARFFQKLQGVLRPLPDFGGLRVAPAVLATLRADIPGGWTCLPDAAQAAPPEVRAALRADLWGLLGQY 1165
Cdd:PHA03246 1071 RLSARYAEVKAAVDGLYSSIRKLLRPLQNFAGLRALDSTVKTITDSIPPGMGSFESFLASAPPDVIGALRSDLWVLFTQY 1150
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1166 RGALEHPTADTAAALSGLHPNFAEVLRDLFPAAPETPLLVSFFSDHAPRVAQAVSEAIAAGSAAVATASPESTVEAAV-- 1243
Cdd:PHA03246 1151 KTILSRPSTGTAAELSGLGGPFALAIRVILGPEGEYPAASVFFGKHADVLSAALAAAASEPMSVEKTSAAVSVLKEAIsd 1230
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1244 --RAQGVLADT-VAALSPAVRDPAcpLAFLVAL---ADSAAGYVKATRLALGARRAIARLGALGAAAADLAVAVRRENPQ 1317
Cdd:PHA03246 1231 inRADATIPQQgIMADLSTERADA--FSFLHALlsdAEGAADVASRGAYLSTLIRTVRDSLEAVIDSITKIKSLDPKTYT 1308
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1318 ADGDRAAlleaaaraveaaragLAACEGEFGgllHAEGSAGDPSPSGRALQ---------------ELGKVVAATRRRAD 1382
Cdd:PHA03246 1309 YNTDASV---------------IVSAKAEVS---AATQQAGDCKDALTALEnepltynphiqrkiaELNKLIESVNQRVG 1370
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1383 ELEAAAADLAEKLADRDARAGRERWAADVEAALDRVENRAEFDAVELRRLQALAAQNKYNPRDFRKRAEQALAANAKTAT 1462
Cdd:PHA03246 1371 ELDIALQTYERNRISAERSRSEDLWTSSITSLLVNAEIKSEFDAMEINSLEETARNAGYDTIRFKSRAEKIVSAHARVVE 1450
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1463 LALEAAFAFNPYTPENQHHPALPPLAAARRIDWGPAFGAAAETYAEMFRVDTEPLARLLRITGGLLDLAQAGGGFIDYHE 1542
Cdd:PHA03246 1451 NAIETVLKFNPYSTTNIIHGLKPPIAALKNITWGDAFFAAAPYYTKLFGVNCDVLISLLKILFAILRHASAHPGNLDYYF 1530
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1543 AVSRLAEDLNGVPSLRHYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLDLAAAAEQTVRLRgDPAAAAELVRTGVT 1622
Cdd:PHA03246 1531 LVGEIESDLRAIPSLAKYVDFYRRGHDSFEGFLARLEQMRVEALHASGRVSIEISDALETLARTH-SPEGARRALEYGVS 1609
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1623 LACPSEDALAACVGALERVDQAPVKDTAYAEHVAFVARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSE 1702
Cdd:PHA03246 1610 IVIPSANTIMSIADALQKEKITELDGTAYAEYSAHILRRDNDAIKSITQRVTTAIEAAKSRGESILKDLAEASYAADRET 1689
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1703 AESLANLKTLLRVAAIPAAAAKTLEQARSVAEIVDQIELLLEQTEKAAELDQAAVDWLEHARRVFEAHPLTAARDGSpDP 1782
Cdd:PHA03246 1690 AEQLANLKNLLRLVAMPAHIAKAIDKAETANDIVTQAALLLTKVEETKELDTQTVEWLKHAESVIDSHDLTVRIDES-GP 1768
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1783 LARLHARLDALGETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAWKSPEALGAAREQLRALQTATNTVLGLVADAHYP 1862
Cdd:PHA03246 1769 MTIYAERIDALVRLENRLAELKSELALAEVAWDDTWSTFIHDKDRIDKSSEGFSAARESAARAKVSTNAINALRNNAEYN 1848
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1863 RLPAKYQGAIGAKSAERAGAVEELGVAVERHDGLLARLrEDVVARVPWEMNADALGRLLAEFDALAEDLTPWAVDEFRGA 1942
Cdd:PHA03246 1849 RLPAKIIGLIDSKYRDRTVVLDAFLDSMKEIEDTQKQM-EILCSKIPLTFSLNDLRAISSQFDDIAKRLPKWYVKQVGRY 1927
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  1943 RALVQHRLGLYSAYAKAraqtgaGGTPPPAPAPLLVDVRALEArarspGERHEPDPRTV----RGRGEAYLRARgdpGPL 2018
Cdd:PHA03246 1928 SRLIKLRLALYAAYSNA------STGPIGDPPLLPFDTGRANI-----AANMAPSVGLVdrylKHRVAAWIRPK---VVA 1993
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2019 ELREATSDLDLPFATSYLTPDGTPLQYVVCFPAVTDKLGALLMRAEAARARPPLPPEGLDSTQTLAAMCTVPLITQLQLA 2098
Cdd:PHA03246 1994 TLQEAFSEIDMPGLTTYLDSTDTPLRYSICYRTVGEKLAACLCEPSAVNIKPRIPTNLISTVEMESAAGMLNDIMILRLG 2073
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2099 LSDAQGDGFRLFGRFVRHRQPGW--RDSAAAAAELYAALTATTLTREFGCRWDELGWERGAPAPAPLA------EPAGTR 2170
Cdd:PHA03246 2074 FVQACADRFQLFTKYVRTGIHDWspDYCIKAHGTIYCALIAITLTRKTGSNLSDIYFIPGQYGPTIDKkelkkfAANGRR 2153
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2171 RPRVTFNMNDVMVALVAGTPEHIYNFWRLDLVLQHEYMHLTLPAAWETGAGAILFVQRLTPH------PSPEVRVLPAVA 2244
Cdd:PHA03246 2154 RQVVRLDPADVMVTIMACTPGHVLTFSKLDLINQYDFMDKTLYNVLTDSISTVAFVNCLSTQlskdgkVDPNCRPLSLTG 2233
                        2330      2340      2350      2360      2370      2380      2390      2400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2245 AGPPPATGLLFGTRLADWRHGKLSSSDPLAPWRAApELAAGGAAAALGGLGGPRALVAVSVLGRMCLPSAALAALWSCMF 2324
Cdd:PHA03246 2234 GEWDPSGGSLFAIRYSDWKQGRLSDTDPLKPWEDI-SGDEGAGLAKIRAVVPSSLLTTTTVLARMCIPPTALAVMWSSLL 2312
                        2410      2420      2430      2440      2450      2460      2470      2480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2325 PEGYSEY-DSLDALLAARLGAGRPPDPQGGREDVPA-------PPAHALYRPSGQRV--LVRGGAPDPAARVTVMDLVLA 2394
Cdd:PHA03246 2313 PDGIEQTcKSYDDVVTARGDGASTLDVTTSTIDHTGvkniqtnLDCPNLYEYTGTATtfTVVSTPPSRVLKVNAMDIAAC 2392
                        2490      2500      2510      2520      2530      2540      2550      2560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  2395 ATLLGAPVVVALRSDPAFSKGSELELCVTLFDSRErGADAALRE--VVSSDVETWATDLLHADLNAIENACLAAQLPALS 2472
Cdd:PHA03246 2393 ATLFGARIVIAAECPEAYSSDSGLSLCIRLFDSRS-GSKGCFLEpgAVSSDITSWGAKLLAADSNPIENACLGQQLEHLS 2471
                        2570      2580      2590      2600      2610      2620
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464  2473 ALIAARPLAGSPPCLVLVDISMAPLFVLWEQPDPPGPPDVRFVGSDE--IEELPFVSPGADVLAGLAAE 2539
Cdd:PHA03246 2472 HLIASKPLSSAPPCLVIVDAGMVPVKVLWAKEELDPIPIIRLTSADDalISELPYIDAGIRGETGADFQ 2540
Herpes_UL36 pfam03586
Herpesvirus UL36 tegument protein; The UL36 open reading frame (ORF) encodes the largest ...
1405-1654 1.55e-92

Herpesvirus UL36 tegument protein; The UL36 open reading frame (ORF) encodes the largest herpes simplex virus type 1 (HSV-1) protein, a 270-kDa polypeptide designated VP1/2, which is also a component of the virion tegument. A null mutation in the UL36 gene of herpes simplex virus type 1 results in accumulation of unenveloped DNA-filled capsids in the cytoplasm of infected cells. This family only covers a small central part of this large protein.


Pssm-ID: 427378 [Multi-domain]  Cd Length: 251  Bit Score: 301.01  E-value: 1.55e-92
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   1405 ERWAADVEAALDRVENRAEFDAVELRRLQALAAQNKYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQHhpAL 1484
Cdd:pfam03586    3 ERWRSDIRALLERDEDDGEFDLDELDRLRDEAETGGYDAVDLIKRARQVVDARAQLGLRALETVFAFNPYTPQNSQ--AL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   1485 PPLAAARRIDWGPAFGAAAETYAEMFRVDTEPLARLLRITGGLLDLAQAG-GGFIDYHEAVSRLAEDLNGVPSLRHYVPF 1563
Cdd:pfam03586   81 PPLALLESITWIDAFPGAADTYTYLFGVSVEKLKALLRIGEEILEAADAAnDGNIDYHAFVLTLSGDLFQVPALTEYVDF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   1564 FRRGHAEYLELCDRLDALRADVHRALGGVPLDLAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAACVGALERVDQ 1643
Cdd:pfam03586  161 YVRSYELFLDIRAALAELRADARGALGRVALEQLAALEEAAEVRRDPEAAKEALERGVRITLPSEDALTAMREGLKLEDK 240
                          250
                   ....*....|.
gi 30984464   1644 APVKDTAYAEH 1654
Cdd:pfam03586  241 KQFEGTAYLEY 251
Herpes_teg_N pfam04843
Herpesvirus tegument protein, N-terminal conserved region;
15-185 2.06e-56

Herpesvirus tegument protein, N-terminal conserved region;


Pssm-ID: 461453 [Multi-domain]  Cd Length: 183  Bit Score: 194.64  E-value: 2.06e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464     15 RNQFAPDLEPGGAVSCMRSSLSFLSLVFDAGLRDALSAEAVDGCLVEGGAW--TRASAGSDPPR-MCSAVELPTFLEYPS 91
Cdd:pfam04843    2 RNQFDCKFGPRAGSQCLSNCVSFLHSSYLNGERPVLSREALDAVLEEGARLdaLLRTSGRLPPRqYAQLHEIPGVIITGA 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464     92 GRGLRCVFSRLYGEVA------FFGKPAPGLLETQCPAHDFFAGPWARRPLSYTLVTIGALGMGLYRDGDEAYLFDPHGL 165
Cdd:pfam04843   82 WGCLIYRSSEIYGLVGhelsrnFNGTPQTGLLDTQCPAGTFFAYPWAKRPPSYTLIICNSLAGAIVIKDDTYYLFDPHCT 161
                          170       180
                   ....*....|....*....|..
gi 30984464    166 REGS--PAFVAKIRAGEVYTYL 185
Cdd:pfam04843  162 PEGNstAAVIVTVDAGDVYPYL 183
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
455-608 6.62e-13

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 74.95  E-value: 6.62e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    455 PTPAGPT--TASSEPPTPAG------PTTASSEP---------------------PTPAGRPPTPAGRPPTP-------- 497
Cdd:pfam05109  461 PASTGPTvsTADVTSPTPAGttsgasPVTPSPSPrdngteskapdmtsptsavttPTPNATSPTPAVTTPTPnatsptlg 540
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    498 -ANPTASSEPPTPAGRPPTPAGRPPTP---------ANPTASSEPPTPNpegAPAPSSNEQPPAAASTDEATQKALDALR 567
Cdd:pfam05109  541 kTSPTSAVTTPTPNATSPTPAVTTPTPnatiptlgkTSPTSAVTTPTPN---ATSPTVGETSPQANTTNHTLGGTSSTPV 617
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 30984464    568 DRQPPEPPCGSLTEllGRHPDTDGGVSRLAAHEAGIAREVT 608
Cdd:pfam05109  618 VTSPPKNATSAVTT--GQHNITSSSTSSMSLRPSSISETLS 656
PHA03269 PHA03269
envelope glycoprotein C; Provisional
455-574 8.61e-13

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 74.38  E-value: 8.61e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPA-GPTTASSEPPTPAGRPPTPAGRPPTPAN-PT-ASSEPPTPAGRPPTPAGRPPTPANPTASSE 531
Cdd:PHA03269   27 PIPELHTSAATQKPDPApAPHQAASRAPDPAVAPTSAASRKPDLAQaPTpAASEKFDPAPAPHQAASRAPDPAVAPQLAA 106
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 30984464   532 PPTPNPEGAPAPSSNEQP-PAAASTDEATQKALDALRDRQPPEP 574
Cdd:PHA03269  107 APKPDAAEAFTSAAQAHEaPADAGTSAASKKPDPAAHTQHSPPP 150
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
1474-2013 5.02e-11

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 69.13  E-value: 5.02e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1474 YTPENQHHPALPPLAAARRIDWGPAFGAAAEtyaemfrvdTEPLARLLRITGGLLDLAQAGGGFIDYHEAVSRLAEDLNG 1553
Cdd:COG3321  853 YPGRGRRRVPLPTYPFQREDAAAALLAAALA---------AALAAAAALGALLLAALAAALAAALLALAAAAAAALALAA 923
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1554 VPSLRHYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLDLAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAA 1633
Cdd:COG3321  924 AALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALA 1003
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1634 CVGALERVDQAPVKDTAYAEHVAFVARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAESLANLKTLL 1713
Cdd:COG3321 1004 LLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAA 1083
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1714 RVAAIPAAAAkTLEQARSVAEIVDQIELLLEQTEKAAELDQAAVDWLEHARRVFEAHPLTAARDGSPDPLARLHARLDAL 1793
Cdd:COG3321 1084 LALAAALAAA-ALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAA 1162
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1794 GETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAWKSPEALGAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAIG 1873
Cdd:COG3321 1163 LAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAA 1242
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1874 AKSAERAGAVEELGVAVERHDGLLARLREDVVARVPWEMNADALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLY 1953
Cdd:COG3321 1243 AAVAALAAAAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALA 1322
                        490       500       510       520       530       540
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1954 SAYAKARAQTGAGGTPPPAPAPLLVDVRALEARARSPGERHEPDPRTVRGRGEAYLRARG 2013
Cdd:COG3321 1323 AALLAAALAALAAAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALALAALA 1382
PHA03269 PHA03269
envelope glycoprotein C; Provisional
455-563 5.35e-11

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 68.22  E-value: 5.35e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAgPTTASSEPPTPA-GPTTASSEPPTPAGRPPTPAGRPPTPA-NPT-ASSEPPTPAGRPPTPAGRPPTPANPTASSE 531
Cdd:PHA03269   42 PAPA-PHQAASRAPDPAvAPTSAASRKPDLAQAPTPAASEKFDPApAPHqAASRAPDPAVAPQLAAAPKPDAAEAFTSAA 120
                          90       100       110
                  ....*....|....*....|....*....|..
gi 30984464   532 PPTPNPEGAPAPSSNEQPPAAASTDEATQKAL 563
Cdd:PHA03269  121 QAHEAPADAGTSAASKKPDPAAHTQHSPPPFA 152
PHA03269 PHA03269
envelope glycoprotein C; Provisional
451-534 7.89e-10

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 64.75  E-value: 7.89e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPA-GPTTASSEPPTPA-GPTTASSEPPTPAGRPPTPAGRPPTPANP--TASSEPPTPAGRPPTPAGRPPTPANP 526
Cdd:PHA03269   64 ASRKPDLAqAPTPAASEKFDPApAPHQAASRAPDPAVAPQLAAAPKPDAAEAftSAAQAHEAPADAGTSAASKKPDPAAH 143

                  ....*...
gi 30984464   527 TASSEPPT 534
Cdd:PHA03269  144 TQHSPPPF 151
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1398-1962 1.45e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 64.17  E-value: 1.45e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1398 RDARAGRERWAADVEAALDRVEN-RAEFDAVELRRLQALAAQNKYNPRDFRKRAEQALAANAKTATLAL-----EAAFAf 1471
Cdd:COG4913  305 ARLEAELERLEARLDALREELDElEAQIRGNGGDRLEQLEREIERLERELEERERRRARLEALLAALGLplpasAEEFA- 383
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1472 npytpENQHHpalpplAAARRIDWGPAFGAAAETYAEMfRVDTEPLARLLRITGGLLDLAQAGGGFID--YHEAVSRLAE 1549
Cdd:COG4913  384 -----ALRAE------AAALLEALEEELEALEEALAEA-EAALRDLRRELRELEAEIASLERRKSNIParLLALRDALAE 451
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1550 DLnGVPSLRhyVPFFrrghAEYLELCDRLDALRADVHRALGGVPLDL-------AAAAE------QTVRLRGDpaaAAEL 1616
Cdd:COG4913  452 AL-GLDEAE--LPFV----GELIEVRPEEERWRGAIERVLGGFALTLlvppehyAAALRwvnrlhLRGRLVYE---RVRT 521
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1617 VRTGVTLACPSEDALAacvgalERVDqapVKDTAYAEHV-AFVARR----------DLGEAKDALVRAKQQRAEAT---- 1681
Cdd:COG4913  522 GLPDPERPRLDPDSLA------GKLD---FKPHPFRAWLeAELGRRfdyvcvdspeELRRHPRAITRAGQVKGNGTrhek 592
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1682 ---DRVTAAL------REALAAHERQARSEAESLANLKtllrvaaipaaaaktlEQARSVAEIVDQIELLLEQTEKAAEL 1752
Cdd:COG4913  593 ddrRRIRSRYvlgfdnRAKLAALEAELAELEEELAEAE----------------ERLEALEAELDALQERREALQRLAEY 656
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1753 DQAAVDWLEHARRvfeahpltaardgspdpLARLHARLDALGETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAWKSP 1832
Cdd:COG4913  657 SWDEIDVASAERE-----------------IAELEAELERLDASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLE 719
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1833 EALGAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAIGAKSAERagAVEELGVAVERHDGLLARLREDVV------- 1905
Cdd:COG4913  720 KELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERE--LRENLEERIDALRARLNRAEEELEramrafn 797
                        570       580       590       600       610       620
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 30984464 1906 -----ARVPWEMNADALGRLLAEFDALAEDLTPWAVDEFRGA--RALVQHRLGLYSAYAKARAQ 1962
Cdd:COG4913  798 rewpaETADLDADLESLPEYLALLDRLEEDGLPEYEERFKELlnENSIEFVADLLSKLRRAIRE 861
PHA03269 PHA03269
envelope glycoprotein C; Provisional
451-552 2.32e-09

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 63.21  E-value: 2.32e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPA-GPTTASSEPPTPA-GPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTP-AGRPPTPAG-RPPTPANP 526
Cdd:PHA03269   50 ASRAPDPAvAPTSAASRKPDLAqAPTPAASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPdAAEAFTSAAqAHEAPADA 129
                          90       100
                  ....*....|....*....|....*.
gi 30984464   527 TASSEPPTPNPegaPAPSSNEQPPAA 552
Cdd:PHA03269  130 GTSAASKKPDP---AAHTQHSPPPFA 152
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
455-614 2.35e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 63.19  E-value: 2.35e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTAssePPTPAGRPPTPAGRPPTPANPTASSEPPTP-AGRPPTPAGRPPTPANPTASSEPP 533
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAA---APAAAPVAQAAAAPAPAAAPAAAASAPAAPpAAAPPAPVAAPAAAAPAAAPAAAP 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   534 TPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGSL--TELlgrHPDTDGGVSRLAAHEA--GIAREVTE 609
Cdd:PRK14951  443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLtpTEE---GDVWHATVQQLAAAEAitALARELAL 519

                  ....*
gi 30984464   610 CSRLT 614
Cdd:PRK14951  520 QSELV 524
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
468-553 1.18e-08

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 61.44  E-value: 1.18e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   468 PTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNE 547
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117

                  ....*....
gi 30984464   548 QP---PAAA 553
Cdd:PRK12270  118 TPlrgAAAA 126
PHA03378 PHA03378
EBNA-3B; Provisional
455-575 1.30e-08

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 61.24  E-value: 1.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTT--------ASSEPPtPAGPTTAS--SEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPA 524
Cdd:PHA03378  676 PSPTGANTmlpiqwapGTMQPP-PRAPTPMRppAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRA 754
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 30984464   525 NPTASSEPPTPNPEGAPAPSSNEQPPAAASTdeATQKALDALRDRQPPEPP 575
Cdd:PHA03378  755 RPPAAAPGRARPPAAAPGAPTPQPPPQAPPA--PQQRPRGAPTPQPPPQAG 803
PHA03269 PHA03269
envelope glycoprotein C; Provisional
488-594 2.12e-08

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 60.13  E-value: 2.12e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   488 PTPAgrPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQP-PAAASTDEATQK----- 561
Cdd:PHA03269   23 NTNI--PIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFdPAPAPHQAASRApdpav 100
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 30984464   562 --ALDALRDRQPPEPPCGSLTellgRHPDT-DGGVS 594
Cdd:PHA03269  101 apQLAAAPKPDAAEAFTSAAQ----AHEAPaDAGTS 132
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
451-540 2.37e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 59.82  E-value: 2.37e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPtpagrPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASS 530
Cdd:PRK14950  372 TAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATP-----PPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTP 446
                          90
                  ....*....|
gi 30984464   531 EPPTPNPEGA 540
Cdd:PRK14950  447 PAPPKEEEKA 456
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
366-573 2.55e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 60.00  E-value: 2.55e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   366 SAGRHHPKRASLPTRTRRSARHAATPFSRGSGGDEQTRPAAGPRPPTPASrpptPGAPPTPGAPPTPGAPPTPGAPPTPA 445
Cdd:PRK07764  591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEAS----AAPAPGVAAPEHHPKHVAVPDASDGG 666
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   446 GPTTASSEPPTPAGPTTASSEPPTPAGPTTASSEP-PTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPA 524
Cdd:PRK07764  667 DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPaPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPD 746
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 30984464   525 NPTASSEPP--TPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPE 573
Cdd:PRK07764  747 DPPDPAGAPaqPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
487-575 3.36e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 59.44  E-value: 3.36e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   487 PPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDAL 566
Cdd:PRK14950  363 VPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKP 442

                  ....*....
gi 30984464   567 RDRQPPEPP 575
Cdd:PRK14950  443 KYTPPAPPK 451
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
386-602 3.55e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 59.50  E-value: 3.55e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   386 RHAATPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTA-- 463
Cdd:PRK12323  359 RMLAFRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAArq 438
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   464 --SSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAP 541
Cdd:PRK12323  439 asARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 30984464   542 APSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGSLTELLGRHPDTDGGVSRLAAHEAG 602
Cdd:PRK12323  519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDG 579
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
349-568 3.93e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 59.23  E-value: 3.93e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   349 SKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGA 428
Cdd:PRK07764  592 PGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   429 PPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPT 508
Cdd:PRK07764  672 KAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDP 751
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   509 PAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRD 568
Cdd:PRK07764  752 AGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEE 811
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
454-571 3.99e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 59.23  E-value: 3.99e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   454 PPTPAGPTTASSEPPTPAGPTTASSEPPT---PAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRP-PTPANPTAS 529
Cdd:PRK07764  399 PSAAAAAPAAAPAPAAAAPAAAAAPAPAAapqPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPaPAAAPEPTA 478
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 30984464   530 SEPPTPNPEGAPAPSsnEQPPAAASTDEATQkALDALRDRQP 571
Cdd:PRK07764  479 APAPAPPAAPAPAAA--PAAPAAPAAPAGAD-DAATLRERWP 517
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
384-575 1.42e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.69  E-value: 1.42e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   384 SARHAATPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTA 463
Cdd:PRK07764  594 AAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKA 673
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   464 SSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAP 543
Cdd:PRK07764  674 GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAG 753
                         170       180       190
                  ....*....|....*....|....*....|..
gi 30984464   544 SSNEQPPAAASTDEATQKALDALRDRQPPEPP 575
Cdd:PRK07764  754 APAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
1187-1828 1.61e-07

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 57.57  E-value: 1.61e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1187 FAEVLRDLFPAAPETPLLVSFFSDHAP-----------------RVAQAVSEAIAAGSAAVATASPESTVEAAVRAQGVL 1249
Cdd:COG3321  734 FRAALAGVTPRAPRIPLISNVTGTWLTgealdadywvrhlrqpvRFADAVEALLADGVRVFLEVGPGPVLTGLVRQCLAA 813
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1250 ADTVAALSPAVRDPACPLAFLVALADSAAGYVKATRLALGARRAIARLGALGAAAADLAVAVRRENPQADGDRAALLEAA 1329
Cdd:COG3321  814 AGDAVVLPSLRRGEDELAQLLTALAQLWVAGVPVDWSALYPGRGRRRVPLPTYPFQREDAAAALLAAALAAALAAAAALG 893
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1330 ARAVEAARAGLAACEGEFGGLLHAEGSAGDPSPSGRALQELGKVVAATRRRADELEAAAADLAEKLADRDARAGRERWAA 1409
Cdd:COG3321  894 ALLLAALAAALAAALLALAAAAAAALALAAAALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAA 973
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1410 DVEAALDRVENRAEFDAVELRRLQALAAQNKYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQHHPALPPLAA 1489
Cdd:COG3321  974 AAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAA 1053
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1490 ARRIDWGPAFGAAAETYAEMFRVDTEPLARLLRITGGLLDLAQAGGGFIDYHEAVSRLAEDLNGVPSLRHYVPFFRRGHA 1569
Cdd:COG3321 1054 AAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALALAALAAALLLLALLAALALAAAAAALLALAALLAA 1133
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1570 EylelcdRLDALRADVHRALGGVPLDLAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAACVGALERVDQAPVKDT 1649
Cdd:COG3321 1134 A------AAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALLAA 1207
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1650 AYAEHVAFVARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRVAAIPAAAAKTLEQA 1729
Cdd:COG3321 1208 LLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALAAAAAALLAALAALALLAAAAGLAALAAAAAAAAAALAL 1287
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1730 RSVAEIVDQIELLLEQTEKAAELDQAAVDWLEHARRVFEAHPLTAARDGSPDPLARLHARLDALGETRRRTAALRRSLEA 1809
Cdd:COG3321 1288 AAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAAVAAALALAAAAAAAAAAAAAAAAAAALAAAA 1367
                        650
                 ....*....|....*....
gi 30984464 1810 AEAEWDEVWARFGRARGGA 1828
Cdd:COG3321 1368 GAAAAAAALALAALAAAVA 1386
PHA03378 PHA03378
EBNA-3B; Provisional
454-566 1.65e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 57.38  E-value: 1.65e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   454 PPTPAGPTTASSEP-PTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEP-PTPAGRPPTPAGRPPTPANPTASSE 531
Cdd:PHA03378  710 PPGRAQRPAAATGRaRPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRaRPPAAAPGAPTPQPPPQAPPAPQQR 789
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 30984464   532 P-----PTPNPEGAPAPSSnEQPPAAASTDEATQKALDAL 566
Cdd:PHA03378  790 PrgaptPQPPPQAGPTSMQ-LMPRAAPGQQGPTKQILRQL 828
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
385-572 1.81e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 57.19  E-value: 1.81e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   385 ARHAATPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTAS 464
Cdd:PRK12323  375 ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   465 SEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPT------------ASSEP 532
Cdd:PRK12323  455 AAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAApagwvaesipdpATADP 534
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 30984464   533 PTPNPEGAPAPSSNEQPPAAASTDE-ATQKALDALRDRQPP 572
Cdd:PRK12323  535 DDAFETLAPAPAAAPAPRAAAATEPvVAPRPPRASASGLPD 575
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1659-1988 2.56e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 56.87  E-value: 2.56e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1659 ARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRVAAIPAAAAKTL-----EQARSVA 1733
Cdd:COG1196  377 AEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALeeaaeEEAELEE 456
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1734 EIVDQIELLLEQTEKAAELDQAAVDWLEHARRVFEAHPLTAARDGSPDPLARLHARLDALGETRRRTAALrrsleAAEAE 1813
Cdd:COG1196  457 EEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAV-----AVLIG 531
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1814 WDEVWARFGRARGGAWKSPEALGAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAIGAKSAERAGAVEELGVAVERH 1893
Cdd:COG1196  532 VEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREA 611
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1894 DGLLARLREDVVARVPWEMNADALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLYSAYAKARAQTGAGGTPPPAP 1973
Cdd:COG1196  612 DARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEE 691
                        330
                 ....*....|....*
gi 30984464 1974 APLLVDVRALEARAR 1988
Cdd:COG1196  692 ELELEEALLAEEEEE 706
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
480-559 2.67e-07

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 56.82  E-value: 2.67e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   480 PPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEAT 559
Cdd:PRK12270   39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
455-592 2.68e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.53  E-value: 2.68e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPT 534
Cdd:PRK07764  615 PAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPA 694
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 30984464   535 PNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGSLTELLGRHPDTDGG 592
Cdd:PRK07764  695 GAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPA 752
PHA03291 PHA03291
envelope glycoprotein I; Provisional
452-576 2.78e-07

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 55.73  E-value: 2.78e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   452 SEPPTPAGPTTASSEPptpAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPP-TPAGRPPTPANPTASS 530
Cdd:PHA03291  173 AAPPLGEGSADGSCDP---ALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPStTIPAPSTTIAAPQAGT 249
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 30984464   531 EPPTPNPEGAPAPSSNEQPPAAAS-TDEATQKALDALRDRQPPEPPC 576
Cdd:PHA03291  250 TPEAEGTPAPPTPGGGEAPPANATpAPEASRYELTVTQIIQIAIPAS 296
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
461-553 5.28e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 55.59  E-value: 5.28e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   461 TTASSEPPTPAGPTTASSEP----PTPAGRPPTPAGRPPTPANPTAssEPPTPAGRPPTPAGRPPTPANPTASSEPPTPN 536
Cdd:PRK14950  359 LLVPVPAPQPAKPTAAAPSPvrptPAPSTRPKAAAAANIPPKEPVR--ETATPPPVPPRPVAPPVPHTPESAPKLTRAAI 436
                          90
                  ....*....|....*..
gi 30984464   537 PEGAPAPSSNEQPPAAA 553
Cdd:PRK14950  437 PVDEKPKYTPPAPPKEE 453
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
452-606 6.11e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 55.27  E-value: 6.11e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   452 SEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPA-NPTASSEPPTPAGRPPTPAGRPPTPANPTASS 530
Cdd:PRK12323  383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPApEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 30984464   531 EPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGSLTELLGRHPDTDG-GVSRLAAHEAGIARE 606
Cdd:PRK12323  463 RPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAeSIPDPATADPDDAFE 539
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
403-568 6.46e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 55.63  E-value: 6.46e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   403 RPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAG---------PTTASSEPPTPAGP 473
Cdd:PRK07003  380 VPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATAdrgddaadgDAPVPAKANARASA 459
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   474 TTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRP-PTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAA 552
Cdd:PRK07003  460 DSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPsAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAA 539
                         170
                  ....*....|....*....
gi 30984464   553 ASTDE---ATQKALDALRD 568
Cdd:PRK07003  540 AAPAAragGAAAALDVLRN 558
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
781-1437 7.25e-07

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 55.65  E-value: 7.25e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  781 LLQRIQALAGFARREEVRAAAEDREVRGALDALARGVDAVARRSG------PLTVAAVSPEEPGEGGGRPHPLSPEAIRV 854
Cdd:COG3321  702 LAARLEARGIRARRLPVSHAFHSPLMEPALEEFRAALAGVTPRAPriplisNVTGTWLTGEALDADYWVRHLRQPVRFAD 781
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  855 RLEQLRADGQKA-VE-GAtreyfHRGAVYSAKALLAGDARDRRYHVASAPVVPVVQLLESL------------PAFDAHV 920
Cdd:COG3321  782 AVEALLADGVRVfLEvGP-----GPVLTGLVRQCLAAAGDAVVLPSLRRGEDELAQLLTALaqlwvagvpvdwSALYPGR 856
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  921 QevARRARVPAPP------PLATSPPAELLRELVQRGRDLEAPADLAAWLASLGDAAGQGLVVRKELDELAQAIYKINER 994
Cdd:COG3321  857 G--RRRVPLPTYPfqredaAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAALLALVA 934
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  995 TVRRSSGLAELERFEALDAALRGELESEAAFEPGGGDGAAAGGLPAETRRLAEDALHQAKAMAAAKLTDELSPEARERLA 1074
Cdd:COG3321  935 LAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAA 1014
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1075 ARVRAIEAMLEEARARAEAAKAALARFFQKLQGVLRPLPDFGGLRVAPAVLATLRADIPGGWTCLPDAAQAAPPEVRAAL 1154
Cdd:COG3321 1015 AAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAA 1094
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1155 RADLWGLLGQYRGALEHPTADTAAALSGLHPNFAEVLRDLFPAAPETPLLVSFFSDHAPRVAQAVSEAIAAGSAAVATAS 1234
Cdd:COG3321 1095 LALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALL 1174
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1235 PESTVEAAVRAQGVLADTVAALSPAVRDPACPLAFLVALADSAAGYVKATRLALGARRAIARLGALGAAAADLAVAVRRE 1314
Cdd:COG3321 1175 LALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALAAAAAA 1254
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1315 NPQADGDRAALLEAAARAVEAARAGLAACEGEFGGLLHAEGSAGDPSPSGRALQELGKVVAATRRRADELEAAAADLAEK 1394
Cdd:COG3321 1255 LLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALA 1334
                        650       660       670       680
                 ....*....|....*....|....*....|....*....|...
gi 30984464 1395 LADRDARAGRERWAADVEAALDRVENRAEFDAVELRRLQALAA 1437
Cdd:COG3321 1335 AAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALA 1377
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
454-523 8.23e-07

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 53.78  E-value: 8.23e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    454 PPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTP 523
Cdd:pfam07174   42 EPAPPPPSTATAPPAPPPPPPAPAAPAPPPPPAAPNAPNAPPPPADPNAPPPPPADPNAPPPPAVDPNAP 111
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
451-628 8.51e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 54.88  E-value: 8.51e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPT--PAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTA 528
Cdd:PRK12323  400 AAPPAAPAAAPAAAAAARAVAAAPARRSPAPEalAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA 479
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   529 ssePPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALrDRQPPEPPCGSLTELLGRHPDTDGGVSRLAAHEAGIAREVT 608
Cdd:PRK12323  480 ---PARAAPAAAPAPADDDPPPWEELPPEFASPAPAQP-DAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAA 555
                         170       180
                  ....*....|....*....|
gi 30984464   609 ECSRLTinALRSPFPGSPGL 628
Cdd:PRK12323  556 ATEPVV--APRPPRASASGL 573
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
451-529 1.01e-06

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 54.90  E-value: 1.01e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTAS 529
Cdd:PRK12270   40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
393-575 1.02e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.61  E-value: 1.02e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   393 SRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPAG 472
Cdd:PRK07764  591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   473 PTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTP-NPEGAPAPSSNEQPPA 551
Cdd:PRK07764  671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPaADDPVPLPPEPDDPPD 750
                         170       180
                  ....*....|....*....|....
gi 30984464   552 AASTDEATQKALDALRDRQPPEPP 575
Cdd:PRK07764  751 PAGAPAQPPPPPAPAPAAAPAAAP 774
PHA03378 PHA03378
EBNA-3B; Provisional
455-575 2.09e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.92  E-value: 2.09e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEP----PTPAGPTTasSEPPTPAGRPPTPAGRPPTPANPTASsePPTPAGRPPTPAGR-PPTPANPTAS 529
Cdd:PHA03378  659 ITPYKPTWTQIGHipyqPSPTGANT--MLPIQWAPGTMQPPPRAPTPMRPPAA--PPGRAQRPAAATGRaRPPAAAPGRA 734
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 30984464   530 SEP---PTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPP 575
Cdd:PHA03378  735 RPPaaaPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAP 783
PTZ00144 PTZ00144
dihydrolipoamide succinyltransferase; Provisional
470-534 2.28e-06

dihydrolipoamide succinyltransferase; Provisional


Pssm-ID: 240289 [Multi-domain]  Cd Length: 418  Bit Score: 53.15  E-value: 2.28e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 30984464   470 PAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPT 534
Cdd:PTZ00144  125 PAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARADPRE 189
Streccoc_I_II NF033804
antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins ...
453-552 2.87e-06

antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins with a glucan-binding domain, two types of repetitive regions, an isopeptide bond-forming domain associated with shear resistance, and a C-terminal LPXTG motif for anchoring to the cell wall. They occur in oral Streptococci, and tend to be major cell surface adhesins. Members of this family include SspA and SspB from Streptococcus gordonii, antigen I/II from S. mutans, etc.


Pssm-ID: 468188 [Multi-domain]  Cd Length: 1552  Bit Score: 53.41  E-value: 2.87e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   453 EPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAgRPPTPANPTASSEP---PTPAG-------RPPTPAGRPPT 522
Cdd:NF033804  876 EPSKPEEPTYETEKPLEPAPVAPTYENEPTPPVKTPDQP-EPSKPEEPTYETEKplePAPVApsyenepTPPVKTPDQPE 954
                          90       100       110
                  ....*....|....*....|....*....|
gi 30984464   523 PANPTASSEPPTPNPEGAPAPSSNEQPPAA 552
Cdd:NF033804  955 PSKPVEPTYDPLPTPPVAPTPKQLPTPPAV 984
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
461-550 3.37e-06

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 51.85  E-value: 3.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    461 TTASSEPPTPAGPTTASSEPPtPAGRPPTPAGRPPTPANPTA--SSEPPTPAGRPPTPAGRPPTPANPTASSEPPT-PNP 537
Cdd:pfam07174   22 AVAGASAVAVALPAVAHADPE-PAPPPPSTATAPPAPPPPPPapAAPAPPPPPAAPNAPNAPPPPADPNAPPPPPAdPNA 100
                           90
                   ....*....|...
gi 30984464    538 EGAPAPSSNEQPP 550
Cdd:pfam07174  101 PPPPAVDPNAPEP 113
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
454-535 4.37e-06

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 52.97  E-value: 4.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   454 PPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPP 533
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117

                  ..
gi 30984464   534 TP 535
Cdd:PRK12270  118 TP 119
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
470-575 5.01e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 52.68  E-value: 5.01e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   470 PAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQP 549
Cdd:PRK07764  386 GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ 465
                          90       100
                  ....*....|....*....|....*.
gi 30984464   550 PAAASTDEATQKALDALRDRQPPEPP 575
Cdd:PRK07764  466 PAPAPAAAPEPTAAPAPAPPAAPAPA 491
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
455-602 5.40e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 52.30  E-value: 5.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTPAGrPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEP-P 533
Cdd:PRK07764  590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAA-PAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGdG 668
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464   534 TPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGSLTELLGRHPDTDGGVSRLAAHEAG 602
Cdd:PRK07764  669 WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADD 737
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
397-598 7.78e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.91  E-value: 7.78e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   397 GGDEQTRPAAGPRPPTPASRPPTPGAPPTPGapptpgapptpgapptpagpTTASSEPPTPAGPTTASSEPPTPAGPTTA 476
Cdd:PRK07764  589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPA--------------------APAAPAAPAPAGAAAAPAEASAAPAPGVA 648
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   477 SSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTD 556
Cdd:PRK07764  649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGA 728
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 30984464   557 EATQKALDALRDRQPPEPPCGSLTELLGRHPDTDGGVSRLAA 598
Cdd:PRK07764  729 SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAP 770
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
353-578 1.02e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.71  E-value: 1.02e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   353 RPTWTPPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFSR---GSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAP 429
Cdd:PHA03307  193 PPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGAsssDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEA 272
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   430 PTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGP------------TTASSEPPTPAGRPPTPA-GRPPT 496
Cdd:PHA03307  273 SGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSsssssresssssTSSSSESSRGAAVSPGPSpSRSPS 352
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   497 PANPTASSEPPTPAGRPPtPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPA-----AASTDEATQKALDALRDRQP 571
Cdd:PHA03307  353 PSRPPPPADPSSPRKRPR-PSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRfpagrPRPSPLDAGAASGAFYARYP 431

                  ....*..
gi 30984464   572 PEPPCGS 578
Cdd:PHA03307  432 LLTPSGE 438
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
356-569 1.07e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 51.46  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    356 WTPPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFSRGSGgdeqTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAP 435
Cdd:pfam05109  439 FAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAG----TTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVT 514
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    436 PTPGAPPTPAGPTtaSSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPT--PANPTASSEPPTPAGRP 513
Cdd:pfam05109  515 TPTPNATSPTPAV--TTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTlgKTSPTSAVTTPTPNATS 592
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464    514 PTpAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDR 569
Cdd:pfam05109  593 PT-VGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLR 647
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
451-574 1.09e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 51.61  E-value: 1.09e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRP--PTPAGRPPTPANPTA 528
Cdd:PTZ00449  517 SGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPefPKDPKHPKDPEEPKK 596
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 30984464   529 SSEPPTPN-PEGAPAPssneQPPAAASTDEATQKALDALRDRQPPEP 574
Cdd:PTZ00449  597 PKRPRSAQrPTRPKSP----KLPELLDIPKSPKRPESPKSPKRPPPP 639
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
451-554 1.36e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 50.86  E-value: 1.36e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTP-AGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTAS 529
Cdd:PLN02217  563 AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGhLGSPPATPSKIVSPSTSPPASHLGSPSTTPSSPESSIKV 642
                          90       100
                  ....*....|....*....|....*
gi 30984464   530 SEPPTPNPEGAPAPSSNEQPPAAAS 554
Cdd:PLN02217  643 ASTETASPESSIKVASTESSVSMVS 667
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
464-549 3.00e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 49.96  E-value: 3.00e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   464 SSEPPTPAGPTTASSEPPTPAgrPPTPAGRPPTPANPTASSEPPTPAGRPPTPAgrppTPANPTASSEPPTPNPEGAPAP 543
Cdd:PRK14948  515 GSASNTAKTPPPPQKSPPPPA--PTPPLPQPTATAPPPTPPPPPPTATQASSNA----PAQIPADSSPPPPIPEEPTPSP 588

                  ....*.
gi 30984464   544 SSNEQP 549
Cdd:PRK14948  589 TKDSSP 594
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
337-574 3.56e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 49.85  E-value: 3.56e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   337 LPRPNQHYPLGFSKRRRPTWTPPSSLedlSAGRHHPKRASLPTRTRRSARHAATPfsRGSGGDEQTRPAAGPRPPTPASR 416
Cdd:PRK07003  380 VPAPGARAAAAVGASAVPAVTAVTGA---AGAALAPKAAAAAAATRAEAPPAAPA--PPATADRGDDAADGDAPVPAKAN 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   417 PPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPT--PAGPTTASSEPPTPAGRPPTPAGRP 494
Cdd:PRK07003  455 ARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVpdARAPAAASREDAPAAAAPPAPEARP 534
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   495 PTPANPTA----------------------SSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPeGAPAPSSNEQPPAA 552
Cdd:PRK07003  535 PTPAAAAPaaraggaaaaldvlrnagmrvsSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTP-RARAATGDAPPNGA 613
                         250       260
                  ....*....|....*....|..
gi 30984464   553 ASTDEATQkaldalrDRQPPEP 574
Cdd:PRK07003  614 ARAEQAAE-------SRGAPPP 628
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
461-570 3.76e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 49.39  E-value: 3.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   461 TTASSEPPTPAGPTTASSEPPT-PAGRPPTPAGRPPTPANPTASSEPPTPAGRP-PTPAGRPPT-PANPTASSE---PPT 534
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIKPVFTqPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSaTQPAGTPPTvSVDPPAAVPvnpPST 442
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 30984464   535 PNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQ 570
Cdd:PRK14971  443 APQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKA 478
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
458-558 4.13e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 49.39  E-value: 4.13e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   458 AGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTpanptASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNP 537
Cdd:PRK14971  388 AAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPT-----VSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSK 462
                          90       100
                  ....*....|....*....|...
gi 30984464   538 EGAPAPSSNE--QPPAAASTDEA 558
Cdd:PRK14971  463 VSSLGPSTLRpiQEKAEQATGNI 485
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1659-2013 4.29e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 49.55  E-value: 4.29e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1659 ARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRVAAIPAAAAKTLEQA-----RSVA 1733
Cdd:COG1196  293 LLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAeeallEAEA 372
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1734 EIVDQIELLLEQTEKAAELDQAAvdwLEHARRvfEAHPLTAARDGspdpLARLHARLDALGETRRRTAALRRSLEAAEAE 1813
Cdd:COG1196  373 ELAEAEEELEELAEELLEALRAA---AELAAQ--LEELEEAEEAL----LERLERLEEELEELEEALAELEEEEEEEEEA 443
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1814 WDEVWARFGRARggawkspEALGAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAigAKSAERAGAVEELGVAVERH 1893
Cdd:COG1196  444 LEEAAEEEAELE-------EEEEALLELLAELLEEAALLEAALAELLEELAEAAARLL--LLLEAEADYEGFLEGVKAAL 514
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1894 DGLLARLREDVVARVPWEMNADALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLYSAYAKARAQTGAGGTPPPAP 1973
Cdd:COG1196  515 LLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALAR 594
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|
gi 30984464 1974 APLLVDVRALEARARSPGERHEPDPRTVRGRGEAYLRARG 2013
Cdd:COG1196  595 GAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEA 634
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
378-895 4.55e-05

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 49.49  E-value: 4.55e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  378 PTRTRRSARHAATPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTP 457
Cdd:COG3321  854 PGRGRRRVPLPTYPFQREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAALLALV 933
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  458 AGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNP 537
Cdd:COG3321  934 ALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLA 1013
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  538 EGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGSLTELLGRHPDTDGGVSRLAAHEAGIAREVTECSRLTINA 617
Cdd:COG3321 1014 AAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAA 1093
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  618 LRSPFPGSPGLLQHCIVFLFERVLAFLIENGARTHAGAGAEGPASGLLDLTVSLLPRRTAVGDFLASTRMTLADVAAHLP 697
Cdd:COG3321 1094 ALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAAL 1173
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  698 LIQPVLDEGSIVGRLALAK-LVLVARDVIRTTDDFHGELAELERRLRATPPTEVYARLSEWLLERSKAGPDTLFAPATPT 776
Cdd:COG3321 1174 LLALALALAAALAAALAGLaALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALAAAAA 1253
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  777 HPEPLLQRIQALAGFARREEVRAAAEDREVRGALDALARGVDAVARRSGPLTVAAVSPEEPGEGGGRPHPLSPEAIRVRL 856
Cdd:COG3321 1254 ALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAAL 1333
                        490       500       510
                 ....*....|....*....|....*....|....*....
gi 30984464  857 EQLRADGQKAVEGATREYFHRGAVYSAKALLAGDARDRR 895
Cdd:COG3321 1334 AAAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAAA 1372
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
395-554 4.79e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 49.46  E-value: 4.79e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   395 GSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPT--PAGPTTASSEPPTPAG 472
Cdd:PRK07003  366 GAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPaaPAPPATADRGDDAADG 445
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   473 PTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAA 552
Cdd:PRK07003  446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAA 525

                  ..
gi 30984464   553 AS 554
Cdd:PRK07003  526 AP 527
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
451-574 5.87e-05

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 48.51  E-value: 5.87e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    451 SSE------PPTPAGPTTASSEPPTPAGP---TTASSEPPTPagRPPTpagrPPTPANPTASSEPPTPAGRPPTPAGRPP 521
Cdd:pfam05539  228 SNPepqtepPPSQRGPSGSPQHPPSTTSQdqsTTGDGQEHTQ--RRKT----PPATSNRRSPHSTATPPPTTKRQETGRP 301
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 30984464    522 TPANPTASSEPPTPNPEGAPAPSSNEQppaaastdeaTQKALDAlRDRQPPEP 574
Cdd:pfam05539  302 TPRPTATTQSGSSPPHSSPPGVQANPT----------TQNLVDC-KELDPPKP 343
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
393-578 6.04e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.01  E-value: 6.04e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   393 SRGSGGDEQTRPAA--GPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEP-----------PTPAG 459
Cdd:PHA03307   13 AAAEGGEFFPRPPAtpGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGteapanesrstPTWSL 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   460 PTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEG 539
Cdd:PHA03307   93 STLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA 172
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 30984464   540 APAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGS 578
Cdd:PHA03307  173 ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSS 211
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
493-573 7.46e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 48.73  E-value: 7.46e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   493 RPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPnPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPP 572
Cdd:PRK12270   37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAA-PAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115

                  .
gi 30984464   573 E 573
Cdd:PRK12270  116 E 116
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
487-565 7.78e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 48.73  E-value: 7.78e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464   487 PPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSneqPPAAASTDEATQKALDA 565
Cdd:PRK12270   40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAA---PPAAAAAAAPAAAAVED 115
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
451-574 8.06e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 48.32  E-value: 8.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASS 530
Cdd:PRK07994  365 LPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEPA 444
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 30984464   531 EPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEP 574
Cdd:PRK07994  445 AASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVE 488
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
452-555 8.65e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 48.03  E-value: 8.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    452 SEPPTPAGP--TTASSEPPTPAG--PTTASSEPPTPAGRPPTPAGRPPTPANPTAssePPTPAGRPPTPAGRPPTPANPT 527
Cdd:pfam17823   98 SEPATREGAadGAASRALAAAASssPSSAAQSLPAAIAALPSEAFSAPRAAACRA---NASAAPRAAIAAASAPHAASPA 174
                           90       100
                   ....*....|....*....|....*...
gi 30984464    528 ASSEPPTPNPEGAPAPSSNEQPPAAAST 555
Cdd:pfam17823  175 PRTAASSTTAASSTTAASSAPTTAASSA 202
PRK12495 PRK12495
hypothetical protein; Provisional
452-540 8.99e-05

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 46.79  E-value: 8.99e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   452 SEPPTPAGPTTaSSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSE 531
Cdd:PRK12495   88 SDAGSQASPDD-DAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGE 166

                  ....*....
gi 30984464   532 PPTPNPEGA 540
Cdd:PRK12495  167 PPTPSTPDA 175
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
452-520 1.08e-04

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 44.30  E-value: 1.08e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464    452 SEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRP 520
Cdd:pfam12526   38 DPPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPEPDLVPPVTGPAGPPSPLAPPAPAQKPPLPPPRP 106
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
404-578 1.09e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.31  E-value: 1.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   404 PAAGPRPPTP-ASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGPTTASSEPPT 482
Cdd:PRK07003  368 PGGGVPARVAgAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   483 PAgRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAgrPPTPANPTASSEPPT-PNPEGAPAPSSNEQPPAAASTDEATQK 561
Cdd:PRK07003  448 PV-PAKANARASADSRCDERDAQPPADSGSASAPA--SDAPPDAAFEPAPRAaAPSAATPAAVPDARAPAAASREDAPAA 524
                         170
                  ....*....|....*..
gi 30984464   562 ALDALRDRQPPEPPCGS 578
Cdd:PRK07003  525 AAPPAPEARPPTPAAAA 541
PHA03246 PHA03246
large tegument protein UL36; Provisional
3189-3287 1.10e-04

large tegument protein UL36; Provisional


Pssm-ID: 223020 [Multi-domain]  Cd Length: 3095  Bit Score: 48.43  E-value: 1.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  3189 LGSLDPETDpetDPFAHPPDPRAPEAgDRASPSSYfgpppLSANAALSRRYVRSTGRSALAVLIEACWRIRRQLRMTRHA 3268
Cdd:PHA03246 3004 LSSTDSDSD---DSRSTVYNSNSTDT-DMSSTSRV-----IIADTLLTRRDFRKASRGALYALTKACEKIARQITQTRDQ 3074
                          90
                  ....*....|....*....
gi 30984464  3269 LLNRSGAVLTGLYHVRMLL 3287
Cdd:PHA03246 3075 LRSRVITLAIEIFKIKMLL 3093
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
486-564 1.14e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 48.35  E-value: 1.14e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464   486 RPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALD 564
Cdd:PRK12270   37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
465-546 1.47e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 47.65  E-value: 1.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   465 SEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPAnPTASSEPPTPAgrPPTPAGRPPTPANPTASSEPPTPNPEGAPAPS 544
Cdd:PRK14948  361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSA-PKTKQAATTPS--PPPAKASPPIPVPAEPTEPSPTPPANAANAPP 437

                  ..
gi 30984464   545 SN 546
Cdd:PRK14948  438 SL 439
dnaA PRK14086
chromosomal replication initiator protein DnaA;
455-627 1.49e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 47.51  E-value: 1.49e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTP-ANPTASSE--PPTPAGRPPTPAGRPP--TPANPTAS 529
Cdd:PRK14086   98 PPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPtARPAYPAYqqRPEPGAWPRAADDYGWqqQRLGFPPR 177
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   530 SEPPTPNPEgAPAPSSNEQPPAAASTDEATQKAlDALRDRQPPEPPCGSLTELLGRHPDTDGgvsrlAAHEAGIAREVTE 609
Cdd:PRK14086  178 APYASPASY-APEQERDREPYDAGRPEYDQRRR-DYDHPRPDWDRPRRDRTDRPEPPPGAGH-----VHRGGPGPPERDD 250
                         170
                  ....*....|....*...
gi 30984464   610 CSRLTInalRSPFPGSPG 627
Cdd:PRK14086  251 APVVPI---RPSAPGPLA 265
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
476-604 1.70e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 47.26  E-value: 1.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   476 ASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPagrpPTPAGRPPTPANPTASSEPPTPNPegAPAPSSNEQPPAAAST 555
Cdd:PRK14948  361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSA----PKTKQAATTPSPPPAKASPPIPVP--AEPTEPSPTPPANAAN 434
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464   556 DEAT-------QKALDALrdrqppEPPcgSLTELLGRHpdtdGGVSRLAAHEAGIA 604
Cdd:PRK14948  435 APPSlnleelwQQILAKL------ELP--STRMLLSQQ----AELVSLDSNRAVIA 478
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
476-572 1.76e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 46.92  E-value: 1.76e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   476 ASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQP-----P 550
Cdd:NF041121   17 RAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPGAALPvrvpaP 96
                          90       100
                  ....*....|....*....|..
gi 30984464   551 AAASTDEATQKALDALRDRQPP 572
Cdd:NF041121   97 PALPNPLELARALRPLKRRVPS 118
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
458-558 1.82e-04

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 46.87  E-value: 1.82e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   458 AGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPP--TPANPTASSEPPTPAGRPPTPAGRPPTPAnPTASSEPPTP 535
Cdd:PTZ00436  227 AAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPakAAAPPAKAAAPPAKAAAPPAKAAAPPAKA-AAAPAKAAAA 305
                          90       100
                  ....*....|....*....|...
gi 30984464   536 NPEGAPAPSSNEQPPAAASTDEA 558
Cdd:PTZ00436  306 PAKAAAAPAKAAAPPAKAAAPPA 328
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
453-574 1.91e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.23  E-value: 1.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   453 EPPTPAGPTTASSEPPTPaGPTTASSEPPTP-----AGRPPTPAGRPPTPAnPTASSEPPTPAGR-----PPT-PAGRPP 521
Cdd:PLN03209  381 KPPTSPIPTPPSSSPASS-KSVDAVAKPAEPdvvpsPGSASNVPEVEPAQV-EAKKTRPLSPYARyedlkPPTsPSPTAP 458
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464   522 TPANPTA---SSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEP 574
Cdd:PLN03209  459 TGVSPSVsstSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSP 514
PRK10856 PRK10856
cytoskeleton protein RodZ;
474-563 1.96e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 46.56  E-value: 1.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   474 TTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPAN--PTASSEPPTPNPEGAPAPSSNEQppA 551
Cdd:PRK10856  165 DTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANvdTAATPAPAAPATPDGAAPLPTDQ--A 242
                          90
                  ....*....|..
gi 30984464   552 AASTDEATQKAL 563
Cdd:PRK10856  243 GVSTPAADPNAL 254
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
453-534 2.05e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 47.19  E-value: 2.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   453 EPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAgrPPTPANPTASSEPPTPAGRPPT------PAGRppTPANP 526
Cdd:PRK12270   56 SAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAA--APAAPPAAAAAAAPAAAAVEDEvtplrgAAAA--VAKNM 131

                  ....*...
gi 30984464   527 TASSEPPT 534
Cdd:PRK12270  132 DASLEVPT 139
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
455-567 2.06e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 47.25  E-value: 2.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTpagRPPTPAGRPPTPANPTASsepPTPAGRPPTPagrPPTPANPTASSEPPT 534
Cdd:PRK14954  382 PSPAGSPDVKKKAPEPDLPQPDRHPGPA---KPEAPGARPAELPSPASA---PTPEQQPPVA---RSAPLPPSPQASAPR 452
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 30984464   535 PNPEGAP---------------------APSSNEQPPAAASTDEATQKALDALR 567
Cdd:PRK14954  453 NVASGKPgvdlgswqgkfmnftrngsrkQPVQASSSDAAQTGVFEGVAELEKLR 506
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
453-574 2.07e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.15  E-value: 2.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   453 EPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSE- 531
Cdd:PRK07003  359 EPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADr 438
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 30984464   532 --PPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEP 574
Cdd:PRK07003  439 gdDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASD 483
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
451-579 2.12e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 46.84  E-value: 2.12e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGP--TTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTpagrPPTPAGRPPTPANPTA 528
Cdd:PLN03209  451 TSPSPTAPTGvsPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPT----SPSPAAPVGKVAPSST 526
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 30984464   529 SSEPPTPNPEGAPAPSSNEQppaaasTDEATQKALDAL---RDRQPPEPPCGSL 579
Cdd:PLN03209  527 NEVVKVGNSAPPTALADEQH------HAQPKPRPLSPYtmyEDLKPPTSPTPSP 574
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
452-575 2.64e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 46.84  E-value: 2.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   452 SEPPTPAGPTTAssEPPTPAGPTTASSEPPTPAGR--PPTPAGRPPTPANPTASSE---PPTpAGRPPTPAGRPPTPANP 526
Cdd:PLN03209  325 SQRVPPKESDAA--DGPKPVPTKPVTPEAPSPPIEeePPQPKAVVPRPLSPYTAYEdlkPPT-SPIPTPPSSSPASSKSV 401
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 30984464   527 TASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALD---ALRDRQPPEPP 575
Cdd:PLN03209  402 DAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSpyaRYEDLKPPTSP 453
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
457-575 2.69e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.78  E-value: 2.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   457 PAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAgrPPTPANPTASSEPPTPN 536
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ--LLAARQQLQRAQGATKA 438
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 30984464   537 PEGAPAPSSNEQPPAAASTDEAtqkALDALRDRQPPEPP 575
Cdd:PRK07994  439 KKSEPAAASRARPVNSALERLA---SVRPAPSALEKAPA 474
flhF PRK06995
flagellar biosynthesis protein FlhF;
456-561 2.86e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 46.50  E-value: 2.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   456 TPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTpaNPTASSEPPTP-AGRPPTPAGRPPTPANPTASSEPPT 534
Cdd:PRK06995   52 APPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAK--RLTAQREQLVArAAAPAAPEAQAPAAPAERAAAENAA 129
                          90       100
                  ....*....|....*....|....*....
gi 30984464   535 PNPEGAP--APSSNEQPPAAASTDEATQK 561
Cdd:PRK06995  130 RRLARAAaaAPRPRVPADAAAAVADAVKA 158
PRK13042 PRK13042
superantigen-like protein SSL4; Reviewed;
461-538 3.01e-04

superantigen-like protein SSL4; Reviewed;


Pssm-ID: 183854 [Multi-domain]  Cd Length: 291  Bit Score: 45.78  E-value: 3.01e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464   461 TTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTP-AGRPPTPANPTASSEPPTPNPE 538
Cdd:PRK13042   24 TTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPsSTKVETPQSPTTKQVPTEINPK 102
PLN02983 PLN02983
biotin carboxyl carrier protein of acetyl-CoA carboxylase
463-550 3.16e-04

biotin carboxyl carrier protein of acetyl-CoA carboxylase


Pssm-ID: 215533 [Multi-domain]  Cd Length: 274  Bit Score: 45.60  E-value: 3.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   463 ASSEPPTPAgpTTASSEPPTPAGRPPTPagrpPTPANPTASSEPPTPAGRPPTPAgrpptPANPTASSEPPTPNPEG--- 539
Cdd:PLN02983  140 ALPQPPPPA--PVVMMQPPPPHAMPPAS----PPAAQPAPSAPASSPPPTPASPP-----PAKAPKSSHPPLKSPMAgtf 208
                          90
                  ....*....|...
gi 30984464   540 --APAPSsneQPP 550
Cdd:PLN02983  209 yrSPAPG---EPP 218
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
404-569 3.47e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.52  E-value: 3.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   404 PAAGPRPPTPASRPPTpgapptpgapptpgapptpgapptpagpttASSEPPTPAGPTTASSEPPTPAGPTTA-SSEPPT 482
Cdd:PRK07764  392 GAPAAAAPSAAAAAPA------------------------------AAPAPAAAAPAAAAAPAPAAAPQPAPApAPAPAP 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   483 PAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNeqPPAAASTDEATQKA 562
Cdd:PRK07764  442 PSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG--ADDAATLRERWPEI 519

                  ....*..
gi 30984464   563 LDALRDR 569
Cdd:PRK07764  520 LAAVPKR 526
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
502-583 3.48e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.42  E-value: 3.48e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   502 ASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGSLTE 581
Cdd:PRK12270   39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118

                  ..
gi 30984464   582 LL 583
Cdd:PRK12270  119 PL 120
PHA02682 PHA02682
ORF080 virion core protein; Provisional
461-571 3.49e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 45.62  E-value: 3.49e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   461 TTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPT-------PAGRPPTPANPTASSE-- 531
Cdd:PHA02682   69 NSACMQRPSGQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTcpppavcPAPARPAPACPPSTRQcp 148
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 30984464   532 --PPTPNPEGAPAP---------SSNEQPPAAASTDEATQKALDALRDRQP 571
Cdd:PHA02682  149 paPPLPTPKPAPAAkpiflhnqlPPPDYPAASCPTIETAPAASPVLEPRIP 199
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
454-526 3.53e-04

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 43.87  E-value: 3.53e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 30984464    454 PPTPAGPTTASSE---PPTPAGPTTASS-EPPTPAGRPPTPAGRPPTPANPtaSSEPPTPAGRPPTPAGRPPTPANP 526
Cdd:pfam15240   88 PPPQGGPRPPPGKpqgPPPQGGNQQQGPpPPGKPQGPPPQGGGPPPQGGNQ--QGPPPPPPGNPQGPPQRPPQPGNP 162
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
466-575 3.58e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 46.38  E-value: 3.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   466 EPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGrpPTPANPTASSEPPTPNPEGAPAPSS 545
Cdd:PRK07003  359 EPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALA--PKAAAAAAATRAEAPPAAPAPPATA 436
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 30984464   546 NE-----QPPAAASTDEATQKALDALRDRQPPEPP 575
Cdd:PRK07003  437 DRgddaaDGDAPVPAKANARASADSRCDERDAQPP 471
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
451-575 3.59e-04

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 45.93  E-value: 3.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    451 SSEPPTPAGPTTASSEPPTPAGPTTASSEP-PTPAGRPPTPagrPPTPANPTASSEPPTPAGRPPTPagrpPTPANPTAS 529
Cdd:pfam13254  221 PSVSGISADSSPTKEEPSEEADTLSTDKEQsPAPTSASEPP---PKTKELPKDSEEPAAPSKSAEAS----TEKKEPDTE 293
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 30984464    530 SEPPTPNPEGAPAPSSneqPPAAASTDE-ATQKALDALRDRQPPEPP 575
Cdd:pfam13254  294 SSPETSSEKSAPSLLS---PVSKASIDKpLSSPDRDPLSPKPKPQSP 337
PRK10856 PRK10856
cytoskeleton protein RodZ;
455-546 3.71e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 45.79  E-value: 3.71e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEpPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPT 534
Cdd:PRK10856  161 SVPLDTSTTTDPATTPAPAAPVDTT-PTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPT 239
                          90
                  ....*....|...
gi 30984464   535 PN-PEGAPAPSSN 546
Cdd:PRK10856  240 DQaGVSTPAADPN 252
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
453-578 3.95e-04

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 43.87  E-value: 3.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    453 EPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEP 532
Cdd:pfam15240   36 EGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPPPQGGPRPPPGKPQGPPPQGGNQQQGP 115
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 30984464    533 PTPNPEGAPAPSSNEQPPAAASTdeatqkaldalrdRQPPEPPCGS 578
Cdd:pfam15240  116 PPPGKPQGPPPQGGGPPPQGGNQ-------------QGPPPPPPGN 148
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
358-584 3.98e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.32  E-value: 3.98e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   358 PPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFsrGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPtpgappt 437
Cdd:PHA03307   31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRF--EPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPA------- 101
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   438 pgaPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPA 517
Cdd:PHA03307  102 ---REGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSS 178
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 30984464   518 GRPPTPANPTASSEPPT-PNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPCGSLTELLG 584
Cdd:PHA03307  179 PEETARAPSSPPAEPPPsTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSG 246
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
475-557 5.14e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 45.72  E-value: 5.14e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   475 TASSEPPTPAGRPPTPAGRPPTPAnPTASSEPPTPAGRPPTPAGRPPtPANPTASSEPPTPNPEGAPAPSSNEQPPAAAS 554
Cdd:PRK14948  512 SQSGSASNTAKTPPPPQKSPPPPA-PTPPLPQPTATAPPPTPPPPPP-TATQASSNAPAQIPADSSPPPPIPEEPTPSPT 589

                  ...
gi 30984464   555 TDE 557
Cdd:PRK14948  590 KDS 592
PRK10856 PRK10856
cytoskeleton protein RodZ;
471-554 5.17e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 45.02  E-value: 5.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   471 AGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAG-RPPTPANPTASSEPPTPNPegAPAPSSNEQP 549
Cdd:PRK10856  169 TTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANvDTAATPAPAAPATPDGAAP--LPTDQAGVST 246

                  ....*
gi 30984464   550 PAAAS 554
Cdd:PRK10856  247 PAADP 251
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
454-545 5.53e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.53  E-value: 5.53e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   454 PPTPAGPTTassepPTPAGPTTASSEPPTPAGRPPTPagrPPTPANPTASSEPPTPAgrpPTPAGRPPTPaNPTASSEPP 533
Cdd:NF033839  281 QDTPKEPGN-----KKPSAPKPGMQPSPQPEKKEVKP---EPETPKPEVKPQLEKPK---PEVKPQPEKP-KPEVKPQLE 348
                          90
                  ....*....|..
gi 30984464   534 TPNPEGAPAPSS 545
Cdd:NF033839  349 TPKPEVKPQPEK 360
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
338-526 5.73e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 5.73e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   338 PRPNQHYPLGFSKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFSRGSGGDEQTRPAAGPRPPTPASRP 417
Cdd:PRK12323  385 PAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARP 464
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   418 PTPGAPPTPGapptPGAPPTPGAPPTPAGPTTASSEPP---TPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRP 494
Cdd:PRK12323  465 AAAGPRPVAA----AAAAAPARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFET 540
                         170       180       190
                  ....*....|....*....|....*....|..
gi 30984464   495 PTPAnPTASSEPPTPAGRPPTPAGRPPTPANP 526
Cdd:PRK12323  541 LAPA-PAAAPAPRAAAATEPVVAPRPPRASAS 571
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
451-532 6.68e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 45.34  E-value: 6.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASS 530
Cdd:PRK14948  513 QSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDS 592

                  ..
gi 30984464   531 EP 532
Cdd:PRK14948  593 SP 594
PRK10905 PRK10905
cell division protein DamX; Validated
461-562 7.16e-04

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 44.54  E-value: 7.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   461 TTASSEPPTPAgPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPagrPPTPANPTASSEPPTPN-PEG 539
Cdd:PRK10905  121 STLPTEPATVA-PVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQATAKTE---PKPVAQTPKRTEPAAPVaSTK 196
                          90       100
                  ....*....|....*....|...
gi 30984464   540 APAPSSNEQPPAAASTDEATQKA 562
Cdd:PRK10905  197 APAATSTPAPKETATTAPVQTAS 219
MISS pfam15822
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ...
376-554 7.42e-04

MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.


Pssm-ID: 318115 [Multi-domain]  Cd Length: 238  Bit Score: 44.21  E-value: 7.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    376 SLPTRTRRSARHAATPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPP 455
Cdd:pfam15822   44 AVPSGLPPSTAPSTVPFGPAPTGMYPSIPLTGPSPGPPAPFPPSGPSCPPPGGPYPAPTVPGPGPIGPYPTPNMPFPELP 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    456 TPAGPTT--ASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGR-PPTPANPTASSEP 532
Cdd:pfam15822  124 RPYGAPTdpAAAAPSGPWGSMSSGPWAPGMGGQYPAPNMPYPSPGPYPAVPPPQSPGAAPPVPWGTvPPGPWGPPAPYPD 203
                          170       180
                   ....*....|....*....|....*
gi 30984464    533 PT---PNPEGAPAPSSNEQPPAAAS 554
Cdd:pfam15822  204 PTgsyPMPGLYPTPNNPFQVPSGPS 228
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
451-561 7.48e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 45.04  E-value: 7.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    451 SSEPPTPAGPTTASSEPPTPAGPTTASSeppTPAGRPPTPAGRPPTPANPTASSEP-PTPAGRPPTPAGRPPTPA---NP 526
Cdd:pfam05539  184 VSHPTYPSQVTPQSQPATQGHQTATANQ---RLSSTEPVGTQGTTTSSNPEPQTEPpPSQRGPSGSPQHPPSTTSqdqST 260
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 30984464    527 TASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQK 561
Cdd:pfam05539  261 TGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKR 295
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
456-565 7.49e-04

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 44.94  E-value: 7.49e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   456 TPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPAN--PTASSEPPTPAGRPPTPAGRPPTPANPTASSEPP 533
Cdd:PTZ00436  232 AAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAapPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAA 311
                          90       100       110
                  ....*....|....*....|....*....|..
gi 30984464   534 TPNPEGAPaPSSNEQPPAAASTDEATQKALDA 565
Cdd:PTZ00436  312 APAKAAAP-PAKAAAPPAKAATPPAKAAAPPA 342
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
451-558 9.06e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.95  E-value: 9.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    451 SSEPPTPAGPTTASSEPPT---PAGPTTASSEPPTPAGRPPTPAG-RPPTPANPTASSEPPTPAGRPPTPAGRPP----T 522
Cdd:pfam17823  128 QSLPAAIAALPSEAFSAPRaaaCRANASAAPRAAIAAASAPHAASpAPRTAASSTTAASSTTAASSAPTTAASSApatlT 207
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 30984464    523 PANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEA 558
Cdd:pfam17823  208 PARGISTAATATGHPAAGTALAAVGNSSPAAGTVTA 243
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
465-554 9.46e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 9.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    465 SEPPTPAGPTTASSEPPTPAGRP-----PTPAGRPPT--PANPTASSEPPTPAG-----RPPTPAGRP---------PTP 523
Cdd:pfam05109  427 STTTSPTLNTTGFAAPNTTTGLPssthvPTNLTAPAStgPTVSTADVTSPTPAGttsgaSPVTPSPSPrdngteskaPDM 506
                           90       100       110
                   ....*....|....*....|....*....|.
gi 30984464    524 ANPTASSEPPTPNPEgAPAPSSNEQPPAAAS 554
Cdd:pfam05109  507 TSPTSAVTTPTPNAT-SPTPAVTTPTPNATS 536
COG3903 COG3903
Predicted ATPase [General function prediction only];
1573-1990 9.93e-04

Predicted ATPase [General function prediction only];


Pssm-ID: 443109 [Multi-domain]  Cd Length: 933  Bit Score: 45.01  E-value: 9.93e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1573 ELCDRLDALRADVHRALGGVPLDLAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAACVGALERVDQAPVKDTAYA 1652
Cdd:COG3903  524 EHDNLRAALRWALAHGDAELALRLAAALAPFWFLRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAA 603
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1653 EHVAFVARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRVAAIPAAAAKTLEQARSV 1732
Cdd:COG3903  604 AAAAAAAAAAAAAAAAAAALLLLAALAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAA 683
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1733 AEivdqiellLEQTEKAAELDQAAVDWLEHARRVFEAHPLTAARDGSPDPLARLHARLDALGETRRRTAALRRSLEAAEA 1812
Cdd:COG3903  684 AA--------ALAAAAAALAAAAAAAALAAAAAAALAAAAAAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAA 755
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1813 EWDEVWARFGRARGGAWKSPEALGAAREQLRALQTATNTVLGLVADAHYPRLPAKYQGAIGAKSAERAGAVEELGVAVER 1892
Cdd:COG3903  756 AAAAAAALAAAAAAAALAALLLALAAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALA 835
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1893 HDGLLARLREDVVARVPWEMNADALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLYSAYAKARAQTGAGGTPPPA 1972
Cdd:COG3903  836 AAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALAAAAAAAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAA 915
                        410
                 ....*....|....*...
gi 30984464 1973 PAPLLVDVRALEARARSP 1990
Cdd:COG3903  916 ALAAAAAAAAAAAAAAAA 933
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
456-538 1.12e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 44.57  E-value: 1.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   456 TPAGPTTASSEPPTPAGPTTASSEPPTPagrPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTP 535
Cdd:PRK14948  514 SGSASNTAKTPPPPQKSPPPPAPTPPLP---QPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTK 590

                  ...
gi 30984464   536 NPE 538
Cdd:PRK14948  591 DSS 593
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
452-572 1.25e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 44.39  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    452 SEPPTPAgPTTASSEPPTPAGPTTASSEPPTPAGRPPTPagrpPTPANPTASSEPPTPAGRpptpagRPPTPANPTASSE 531
Cdd:pfam13254  247 DKEQSPA-PTSASEPPPKTKELPKDSEEPAAPSKSAEAS----TEKKEPDTESSPETSSEK------SAPSLLSPVSKAS 315
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 30984464    532 PPTPNPEGAPAPSSNEQPPAAASTDEATQkaldaLRDRQPP 572
Cdd:pfam13254  316 IDKPLSSPDRDPLSPKPKPQSPPKDFRAN-----LRSREVP 351
PTZ00144 PTZ00144
dihydrolipoamide succinyltransferase; Provisional
455-535 1.26e-03

dihydrolipoamide succinyltransferase; Provisional


Pssm-ID: 240289 [Multi-domain]  Cd Length: 418  Bit Score: 44.29  E-value: 1.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRpptpANPTASSEPPT 534
Cdd:PTZ00144  120 TGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVAR----ADPRETRVPMS 195

                  .
gi 30984464   535 P 535
Cdd:PTZ00144  196 R 196
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
454-581 1.32e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.46  E-value: 1.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   454 PPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTP----ANPTASSEPPTPAGRPPTPAGRPPTPANPTAS 529
Cdd:PRK07003  383 PGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPpaapAPPATADRGDDAADGDAPVPAKANARASADSR 462
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 30984464   530 SEPPTPNPEGAPAPSS---NEQPPAAASTDE--ATQKALDALRDRQPPEPPCGSLTE 581
Cdd:PRK07003  463 CDERDAQPPADSGSASapaSDAPPDAAFEPAprAAAPSAATPAAVPDARAPAAASRE 519
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
401-575 1.32e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 1.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    401 QTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGPTTASSEP 480
Cdd:pfam03154  168 QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    481 P-TPAGRPPTPAGRPPTPANPTASSEPPTPAGRP----------PTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQP 549
Cdd:pfam03154  248 PlQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSlqtgpshmqhPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHT 327
                          170       180
                   ....*....|....*....|....*.
gi 30984464    550 PAAASTDEATQKAldalrdRQPPEPP 575
Cdd:pfam03154  328 PPSQSQLQSQQPP------REQPLPP 347
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
458-565 1.44e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 43.79  E-value: 1.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   458 AGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPAN--PTASSEPPTPAGRPPTPAGRPPTPAnptasSEPPTp 535
Cdd:PTZ00436  220 AAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAapPAKAAAPPAKAAAPPAKAAAPPAKA-----AAPPA- 293
                          90       100       110
                  ....*....|....*....|....*....|
gi 30984464   536 npEGAPAPSSNEQPPAAASTDEATQKALDA 565
Cdd:PTZ00436  294 --KAAAAPAKAAAAPAKAAAAPAKAAAPPA 321
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
340-572 1.51e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 1.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    340 PNQHYPLgfskRRRPTWTPPS--SLEDLSAGRHHPKRASLPTRTRRSARHAATPF-SRGSGGDEQTRPAAGPRPPTPASR 416
Cdd:pfam03154  243 PSPHPPL----QPMTQPPPPSqvSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVpPQPFPLTPQSSQSQVPPGPSPAAP 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    417 PPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTT------ASSEPPTPAGPT---TASSEPPTPAGRP 487
Cdd:pfam03154  319 GQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPqlpnpqSHKHPPHLSGPSpfqMNSNLPPPPALKP 398
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    488 --------PTPAGRPPTPANPTASSEPPTPA---------GRPPTPAGRPPTPANPTASSEPPTPN----PEGAPAPSSN 546
Cdd:pfam03154  399 lsslsthhPPSAHPPPLQLMPQSQQLPPPPAqppvltqsqSLPPPAASHPPTSGLHQVPSQSPFPQhpfvPGGPPPITPP 478
                          250       260
                   ....*....|....*....|....*.
gi 30984464    547 EQPPAAASTdeatqkaldALRDRQPP 572
Cdd:pfam03154  479 SGPPTSTSS---------AMPGIQPP 495
EntF COG1020
EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites ...
734-1192 1.62e-03

EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 440643 [Multi-domain]  Cd Length: 1329  Bit Score: 44.46  E-value: 1.62e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  734 ELAELERRLRATPP-TEVYArlsewLLERSKAGPDTLFAPATPTHPEPLLQRIQALAGFARREEVRAAAEDREVRGALDA 812
Cdd:COG1020  884 ELGEIEAALLQHPGvREAVV-----VAREDAPGDKRLVAYVVPEAGAAAAAALLRLALALLLPPYMVPAAVVLLLPLPLT 958
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  813 LARGVDAVARRSGPLTVAAVSPEEPGEgggrphplsPEAIRVRLEQLRADGQKAVEGATREYFHRGAVYSAKALLAGDAR 892
Cdd:COG1020  959 GNGKLDRLALPAPAAAAAAAAAAPPAE---------EEEEEAALALLLLLVVVVGDDDFFFFGGGLGLLLLLALARAARL 1029
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  893 DRRYHVASAPVVPVVQLLESLPAFDAHVQEVARRARVPAPPPLATSPPAELLRELVQRGRDLEAPADLAAWLASLGDAAG 972
Cdd:COG1020 1030 LLLLLLLLLLFLAAAAAAAAAAAAAAAAAAAAPLAAAAAPLPLPPLLLSLLALLLALLLLLALLALLALLLLLLLLLLLL 1109
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  973 QGLVVRKELDELAQAIYKINERTVRRSSGLAELERFEALDAALRGELESEAAFEPGGGDGAAAGGLPAETRRLAEDALHQ 1052
Cdd:COG1020 1110 ALLLLLALLLALLAALRARRAVRQEGPRLRLLVALAAALALAALLALLLAAAAAAAELLAAAALLLLLALLLLALLLLLL 1189
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1053 AKAMAAAKLTDELSPEARERLAARVRAIEAMLEEARARAEAAKAALARFFQKLQGVLRPLPDFGGLRVAPAVLATLRADI 1132
Cdd:COG1020 1190 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLAAAAAALLALALLLALLALAALLALAALAALAAALLALALALLAL 1269
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1133 PGGWTCLPDAAQAAPPEVRAALRADLWGLLGQYRGALEHPTADTAAALSGLHPNFAEVLR 1192
Cdd:COG1020 1270 ALLLLALALLLPALARARAARTARALALLLLLALLLLLALALALLLLLLLLLALLLLALL 1329
PRK11633 PRK11633
cell division protein DedD; Provisional
454-567 1.62e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 42.68  E-value: 1.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   454 PPTPAGPTTassePPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPagrPPTPagrPPTPANPTASSEPP 533
Cdd:PRK11633   58 AATQALPTQ----PPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKP---KPVE---KPKPKPKPQQKVEA 127
                          90       100       110
                  ....*....|....*....|....*....|....
gi 30984464   534 TPNPEGAPAPSSNEQPpaaASTDEATQKALDALR 567
Cdd:PRK11633  128 PPAPKPEPKPVVEEKA---APTGKAYVVQLGALK 158
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
451-564 1.64e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 1.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPP-TPAGRPPT------PAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTP 523
Cdd:pfam03154  444 AASHPPTSGLHQVPSQSPFPQHPFVPGGPPPiTPPSGPPTstssamPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEA 523
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 30984464    524 ANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALD 564
Cdd:pfam03154  524 LDEAEEPESPPPPPRSPSPEPTVVNTPSHASQSARFYKHLD 564
PHA02682 PHA02682
ORF080 virion core protein; Provisional
455-568 1.66e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 43.31  E-value: 1.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPA-GPTTASSEPPTPAGPTTASSEPPTpagrPPTPAGRPPTPANPTASsepPTPAGRPPTPAGRPPTPANPTASsepp 533
Cdd:PHA02682  112 PAPAcPPATAPTCPPPAVCPAPARPAPAC----PPSTRQCPPAPPLPTPK---PAPAAKPIFLHNQLPPPDYPAAS---- 180
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 30984464   534 TPNPEGAPAPSSNEQP-------PAAASTDEATQKAL----DALRD 568
Cdd:PHA02682  181 CPTIETAPAASPVLEPripdkiiDADNDDKDLIKKELadiaDSVRD 226
PHA03255 PHA03255
BDLF3; Provisional
457-544 1.69e-03

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 42.97  E-value: 1.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   457 PAGPTTASSEPPTPAGPTTASSEPPTPAGrpptPAGRPPTPANPTASSEPPTPAGRPPTPAGR-PPTPANPTAS------ 529
Cdd:PHA03255   89 TPVPTTSNASTINVTTKVTAQNITATEAG----TGTSTGVTSNVTTRSSSTTSATTRITNATTlAPTLSSKGTSnatktt 164
                          90
                  ....*....|....*
gi 30984464   530 SEPPTPNPEGAPAPS 544
Cdd:PHA03255  165 AELPTVPDERQPSLS 179
PRK12495 PRK12495
hypothetical protein; Provisional
454-539 1.76e-03

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 42.93  E-value: 1.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   454 PPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPtPAGRPPTPANPTASSEPP 533
Cdd:PRK12495   96 PDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRS-PRQRPPVSGEPPTPSTPD 174

                  ....*.
gi 30984464   534 TPNPEG 539
Cdd:PRK12495  175 AHVAGT 180
PTZ00144 PTZ00144
dihydrolipoamide succinyltransferase; Provisional
451-508 1.77e-03

dihydrolipoamide succinyltransferase; Provisional


Pssm-ID: 240289 [Multi-domain]  Cd Length: 418  Bit Score: 43.90  E-value: 1.77e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAgpttASSEPPTPAGRPPTPAGRPPTPANPTASSEPPT 508
Cdd:PTZ00144  136 KAEKTTPEKPKAAAPTPEPPA----ASKPTPPAAAKPPEPAPAAKPPPTPVARADPRE 189
PHA00666 PHA00666
putative protease
461-564 1.83e-03

putative protease


Pssm-ID: 222808 [Multi-domain]  Cd Length: 233  Bit Score: 42.72  E-value: 1.83e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   461 TTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPanptaSSEPPTPAGRPPTPAGRPPTPaNPTASSEPPTPNPEGA 540
Cdd:PHA00666   10 RRLCNEQPGDGGSQPAASEPAAGAGDNPAPQGDPTQE-----EGDKPQPAAGADKPEGDKKAD-GDKPEEKKPGEKPEGA 83
                          90       100
                  ....*....|....*....|....
gi 30984464   541 PApSSNEQPPAAASTDEATQKALD 564
Cdd:PHA00666   84 PE-KYEFQAAEGVELDTGALGAFE 106
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
455-543 1.85e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.99  E-value: 1.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTA-SSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPtassEPPTPAGRP----PTPAGRP-PTPANPTA 528
Cdd:NF033839  290 KKPSAPKPGmQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQP----EKPKPEVKPqletPKPEVKPqPEKPKPEV 365
                          90
                  ....*....|....*
gi 30984464   529 SSEPPTPNPEGAPAP 543
Cdd:NF033839  366 KPQPEKPKPEVKPQP 380
PRK10856 PRK10856
cytoskeleton protein RodZ;
451-533 1.87e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 43.48  E-value: 1.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGPTTASSEPPTPA-GPTTASSEPPTPAGRPPTPAGRPPTPAN-PTASSEPPTPAGRPPTPAGRPPTPANPTA 528
Cdd:PRK10856  167 STTTDPATTPAPAAPVDTTPTnSQTPAVATAPAPAVDPQQNAVVAPSQANvDTAATPAPAAPATPDGAAPLPTDQAGVST 246

                  ....*
gi 30984464   529 SSEPP 533
Cdd:PRK10856  247 PAADP 251
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
453-574 1.95e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.99  E-value: 1.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   453 EPPTPAGPTTASSEPPTPagptTASSEPPTPAGRP-PTPAGR----PPTPANPTASSEP----PTPAGRP----PTPAGR 519
Cdd:NF033839  302 SPQPEKKEVKPEPETPKP----EVKPQLEKPKPEVkPQPEKPkpevKPQLETPKPEVKPqpekPKPEVKPqpekPKPEVK 377
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464   520 P-PTPANPTASSEPPTPNPEGAPAPssnEQPPAAASTDEATQKAlDALRDRQPPEP 574
Cdd:NF033839  378 PqPETPKPEVKPQPEKPKPEVKPQP---EKPKPEVKPQPEKPKP-EVKPQPEKPKP 429
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
451-550 1.97e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.87  E-value: 1.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASS 530
Cdd:pfam09770  207 AKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQ 286
                           90       100
                   ....*....|....*....|
gi 30984464    531 EPPTPNPEGAPAPSSNEQPP 550
Cdd:pfam09770  287 QFHQQPPPVPVQPTQILQNP 306
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1649-1962 1.97e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 44.17  E-value: 1.97e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1649 TAY--AEHVAFVARRD-----LGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAE------SLANLKTLLR- 1714
Cdd:COG3096  266 TNYvaADYMRHANERRelserALELRRELFGARRQLAEEQYRLVEMARELEELSARESDLEQDyqaasdHLNLVQTALRq 345
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1715 -------VAAIPAAAAKTLEQARSVAEIVDQIELLLEQTEKAAE---------------LD---------QAAVDWLEHA 1763
Cdd:COG3096  346 qekieryQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEevdslksqladyqqaLDvqqtraiqyQQAVQALEKA 425
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1764 RRVFEAHPLTAarDGSPDPLARLHARLDALGETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAWKSpEALGAAREQLR 1843
Cdd:COG3096  426 RALCGLPDLTP--ENAEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYELVCKIAGEVERS-QAWQTARELLR 502
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1844 alqtatntvlglvaDAHYPRLPAKYQGAIGAKSAERAGAVEELGVAVERHDGLLARLREDVVArvpwemnADALGRLLAE 1923
Cdd:COG3096  503 --------------RYRSQQALAQRLQQLRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDA-------AEELEELLAE 561
                        330       340       350
                 ....*....|....*....|....*....|....*....
gi 30984464 1924 FDALAEDLTPWAVDEFRGARALVQHRLGLYSAYAKARAQ 1962
Cdd:COG3096  562 LEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAAR 600
PHA03381 PHA03381
tegument protein VP22; Provisional
452-554 2.28e-03

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 43.08  E-value: 2.28e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   452 SEPPTPAGPTTASSEPPTPAGPTTASSEPPTPaGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSE 531
Cdd:PHA03381   69 ADYPYYTGSSSEDERPADPRPSRRPHAQPEAS-GPGPARGARGPAGSRGRGRRAESPSPRDPPNPKGASAPRGRKSACAD 147
                          90       100
                  ....*....|....*....|...
gi 30984464   532 PPTPNPEGAPAPSSNEQPPAAAS 554
Cdd:PHA03381  148 SAALLDAPAPAAPKRQKTPAGLA 170
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
453-576 2.42e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 43.32  E-value: 2.42e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  453 EPPTPAGPTTASSEPPTPagPTTASSEPPTPaGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEP 532
Cdd:cd23959  128 ETHKTAQVAPPKAEPQTA--PVTPFGQLPMF-GQHPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSASPFATATDTAP 204
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....
gi 30984464  533 PTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEPPC 576
Cdd:cd23959  205 SSGAPDGFPAEASAPSPFAAPASAASFPAAPVANGEAATPTHAC 248
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
451-577 2.54e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 43.54  E-value: 2.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   451 SSEPPTPAGPTTASSEPPTPAGPT------TASSEPPTPAGRPPTPAGR------PPTPANPtasSEPPTPAGRPPTPA- 517
Cdd:PRK08691  379 SPSAQTAEKETAAKKPQPRPEAETaqtpvqTASAAAMPSEGKTAGPVSNqenndvPPWEDAP---DEAQTAAGTAQTSAk 455
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464   518 -----GRPPTPANPTASSEPPTPNPEGAP-APSSNEQPPAAASTDEATQKALDALRDRQPPEPPCG 577
Cdd:PRK08691  456 siqtaSEAETPPENQVSKNKAADNETDAPlSEVPSENPIQATPNDEAVETETFAHEAPAEPFYGYG 521
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
474-546 2.74e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 42.96  E-value: 2.74e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 30984464    474 TTASSEPPTPAGRPPTPAgrPPTPANPTASSEPPTPAGRPPtpaGRPPTPANPTASSEPPTPN-PEGAPAPSSN 546
Cdd:TIGR00601   81 TGKVAPPAATPTSAPTPT--PSPPASPASGMSAAPASAVEE---KSPSEESATATAPESPSTSvPSSGSDAAST 149
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
351-558 2.75e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 2.75e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   351 RRRPTWTPPSSLEDLSAGRHHPKRASLPTRTRRSARHAATPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPP 430
Cdd:PHA03307   64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGS 143
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   431 TPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGPTTASSEPP--TPAGRPPTPAGRPPTPANPTASSEPPT 508
Cdd:PHA03307  144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPpsTPPAAASPRPPRRSSPISASASSPAPA 223
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 30984464   509 PA-------------------------GRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEA 558
Cdd:PHA03307  224 PGrsaaddagasssdssssessgcgwgPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSP 298
PRK12438 PRK12438
hypothetical protein; Provisional
497-570 2.81e-03

hypothetical protein; Provisional


Pssm-ID: 171499 [Multi-domain]  Cd Length: 991  Bit Score: 43.70  E-value: 2.81e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 30984464   497 PANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNpegAPAPSSNEQPPAAAStdeATQKALDALRDRQ 570
Cdd:PRK12438  897 PGTGRVATAPGGDAASAPPPGAGPPAPPQAVPPPRTTQPP---AAPPRGPDVPPAAVA---ELRETLADLRSAQ 964
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
460-627 2.84e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 43.54  E-value: 2.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   460 PTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRP-PTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEppTPNPE 538
Cdd:PRK08691  360 PLAAASCDANAVIENTELQSPSAQTAEKETAAKKPqPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQENND--VPPWE 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   539 GAPAPSSNEQPPAAASTdEATQKAldalrdRQPPEPPCGSLTELLGRHPDTDGGVSRLAAHEAGIAREVTECSRLTINAL 618
Cdd:PRK08691  438 DAPDEAQTAAGTAQTSA-KSIQTA------SEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAH 510

                  ....*....
gi 30984464   619 RSPFPGSPG 627
Cdd:PRK08691  511 EAPAEPFYG 519
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
494-575 2.97e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 43.07  E-value: 2.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   494 PPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPE 573
Cdd:NF041121   16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPGAALPVRVPA 95

                  ..
gi 30984464   574 PP 575
Cdd:NF041121   96 PP 97
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
455-549 2.99e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 43.13  E-value: 2.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSE-PP 533
Cdd:PRK14959  398 PTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDaPP 477
                          90
                  ....*....|....*.
gi 30984464   534 TPNPEGAPAPSSNEQP 549
Cdd:PRK14959  478 TLGDPSDTAEHTPSGP 493
PHA03377 PHA03377
EBNA-3C; Provisional
455-574 3.05e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 43.50  E-value: 3.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTPAGR------------------------------------------------ 486
Cdd:PHA03377  468 LTPVEHTTVILHQPPQSPPTVAIKPAPPPSRRrrgacvvydddiievidvetteeeesvtqpakphrkvqdgfqrsgrrq 547
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   487 ----PPT--PAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNE-QPPAAASTDEAT 559
Cdd:PHA03377  548 kratPPKvsPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEkQPPSSAPRDMAP 627
                         170
                  ....*....|....*
gi 30984464   560 QKALDALRDRQPPEP 574
Cdd:PHA03377  628 SVVRMFLRERLLEQS 642
PHA03264 PHA03264
envelope glycoprotein D; Provisional
480-554 3.07e-03

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 43.07  E-value: 3.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   480 PPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSE-----------PPTPNPEGAPA-PSSNE 547
Cdd:PHA03264  267 PPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGgepkpgpprpaPDADRPEGWPSlEAITF 346

                  ....*..
gi 30984464   548 QPPAAAS 554
Cdd:PHA03264  347 PPPTPAT 353
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
390-595 3.19e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.22  E-value: 3.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   390 TPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAgpTTASSEPPT 469
Cdd:NF033839  283 TPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPK--PEVKPQPEK 360
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   470 PAGPTTASSEPPTPAGRP----PTPAGRP-PTPANPTASSEP--PTPAGRP----PTPAGRP-PTPANPTASSEPPTPNP 537
Cdd:NF033839  361 PKPEVKPQPEKPKPEVKPqpetPKPEVKPqPEKPKPEVKPQPekPKPEVKPqpekPKPEVKPqPEKPKPEVKPQPEKPKP 440
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 30984464   538 EGAPAPSSN--EQPPAAASTDEATQKALDALRDRQPPEP----PCGSLTELLGRHPDTDGGVSR 595
Cdd:NF033839  441 EVKPQPEKPkpEVKPQPETPKPEVKPQPEKPKPEVKPQPekpkPDNSKPQADDKKPSTPNNLSK 504
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
451-509 3.31e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 42.96  E-value: 3.31e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 30984464    451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRP---PTPANPTASSEPPTP 509
Cdd:TIGR00601   83 KVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESataTAPESPSTSVPSSGS 144
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1565-1962 3.75e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 3.75e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1565 RRGHAEYLELCDRLDALRADVHRALggvplDLAAAAEQTVRLRGDPAAAAELVRTGVTLacpsEDALAACVGALERVDQa 1644
Cdd:COG4717   84 EEKEEEYAELQEELEELEEELEELE-----AELEELREELEKLEKLLQLLPLYQELEAL----EAELAELPERLEELEE- 153
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1645 pvKDTAYAEhvafvARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRVAAipaaaak 1724
Cdd:COG4717  154 --RLEELRE-----LEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQ------- 219
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1725 tlEQARSVAEIVDQIELLLEQTEKAAELDQAAVDWL-------------EHARRVFEAHPLTAARDGSPDPLARLHARLD 1791
Cdd:COG4717  220 --EELEELEEELEQLENELEAAALEERLKEARLLLLiaaallallglggSLLSLILTIAGVLFLVLGLLALLFLLLAREK 297
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1792 ALGETRRRTAALRRSLEA-AEAEWDEVWARFGRARGgawKSPEALGAAREQLRALQTATNTVLGLVADAHYPRLPAKYQ- 1869
Cdd:COG4717  298 ASLGKEAEELQALPALEElEEEELEELLAALGLPPD---LSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAa 374
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1870 --GAIGAKSAE----RAGAVEELGVAVERHDGLLARLRE------DVVARVPWEMNADALGRLLAEFDALAEDLtpwavD 1937
Cdd:COG4717  375 llAEAGVEDEEelraALEQAEEYQELKEELEELEEQLEEllgeleELLEALDEEELEEELEELEEELEELEEEL-----E 449
                        410       420
                 ....*....|....*....|....*...
gi 30984464 1938 EFRGARALVQHRLGLYS---AYAKARAQ 1962
Cdd:COG4717  450 ELREELAELEAELEQLEedgELAELLQE 477
PLN02983 PLN02983
biotin carboxyl carrier protein of acetyl-CoA carboxylase
453-521 3.77e-03

biotin carboxyl carrier protein of acetyl-CoA carboxylase


Pssm-ID: 215533 [Multi-domain]  Cd Length: 274  Bit Score: 42.13  E-value: 3.77e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 30984464   453 EPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTA--SSEPP--TP-AG---RPPTPaGRPP 521
Cdd:PLN02983  143 QPPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKApkSSHPPlkSPmAGtfyRSPAP-GEPP 218
PHA02682 PHA02682
ORF080 virion core protein; Provisional
493-575 3.95e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 42.16  E-value: 3.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   493 RPPTPANPTAssepPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPP 572
Cdd:PHA02682   74 QRPSGQSPLA----PSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPP 149

                  ...
gi 30984464   573 EPP 575
Cdd:PHA02682  150 APP 152
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
461-530 4.01e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 42.57  E-value: 4.01e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    461 TTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASS 530
Cdd:TIGR00601   80 GTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSSGSDAAST 149
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
460-569 4.05e-03

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 42.48  E-value: 4.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   460 PTTASSEPPTPAGPTTA---SSEPPTPAGRPPTPAGRPPTPANPTASSEPPT-PAGRPPTPAGRPPTpANPTASSEPPTP 535
Cdd:PRK12373  205 RYNASKALAEDIGDTVKridGTEVPLLAPWQGDAAPVPPSEAARPKSADAETnAALKTPATAPKAAA-KNAKAPEAQPVS 283
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 30984464   536 NPEGA-PAPSSNEQPPAAASTDEATQKALDALRDR 569
Cdd:PRK12373  284 GTAAAePAPKEAAKAAAAAAKPALEDKPRPLGIAR 318
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
457-537 4.29e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 42.64  E-value: 4.29e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   457 PAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPagrpPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPN 536
Cdd:PRK14948  361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSA----PKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAP 436

                  .
gi 30984464   537 P 537
Cdd:PRK14948  437 P 437
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
451-537 4.70e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.43  E-value: 4.70e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  451 SSEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASS 530
Cdd:COG3469  124 STTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTAT 203

                 ....*..
gi 30984464  531 EPPTPNP 537
Cdd:COG3469  204 TTGPPTP 210
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
409-560 5.02e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 42.61  E-value: 5.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   409 RPPTPASrpptpGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSE--PPTPAGPTTASSEPPTPagr 486
Cdd:PLN03209  327 RVPPKES-----DAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDlkPPTSPIPTPPSSSPASS--- 398
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   487 PPTPAGRPPTPANPTASSEP------------PTPAGRPPTPAGR-----PPTPANPTASSEPPTPNPEGAPAPSSNEQP 549
Cdd:PLN03209  399 KSVDAVAKPAEPDVVPSPGSasnvpevepaqvEAKKTRPLSPYARyedlkPPTSPSPTAPTGVSPSVSSTSSVPAVPDTA 478
                         170
                  ....*....|.
gi 30984464   550 PAAASTDEATQ 560
Cdd:PLN03209  479 PATAATDAAAP 489
TALPID3 pfam15324
Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for ...
455-557 5.30e-03

Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for Hedgehog signalling. Mutations in this gene noticed first in chickens lead to multiple abnormalities of development.


Pssm-ID: 434634 [Multi-domain]  Cd Length: 1288  Bit Score: 42.57  E-value: 5.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    455 PTPAgPTTASSEPPTPAGPTTASSEPPTPAGRP-PTPAGRPPTPANPTASSEPPTPAGRP-PTPAGRPPTPANPTASSEP 532
Cdd:pfam15324  985 PTPV-PTPQPTPPCSPPSPLKEPSPVKTPDSSPcVSEHDFFPVKEIPPEKGADTGPAVSLvITPTVTPIATPPPAATPTP 1063
                           90       100
                   ....*....|....*....|....*
gi 30984464    533 PTPNPEGAPAPSSNEQPPAAASTDE 557
Cdd:pfam15324 1064 PLSENSIDKLKSPSPELPKPWEDSD 1088
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1653-2012 5.39e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 42.62  E-value: 5.39e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1653 EHVAFVARRDLGEAKDALVRAKQQRAEAT---DRVTAALREALAAHERQARSEAESLANLKTllrvaaiPAAAAKTLEQA 1729
Cdd:COG1196  226 EAELLLLKLRELEAELEELEAELEELEAEleeLEAELAELEAELEELRLELEELELELEEAQ-------AEEYELLAELA 298
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1730 RSVAEIVDQIELLLEQTEKAAELDQAAVDWLEHARRVFEAHPLTAARdgspdpLARLHARLDALGETRRRTAALRRSLEA 1809
Cdd:COG1196  299 RLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEE------LEEAEEELEEAEAELAEAEEALLEAEA 372
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1810 AEAEWDEVWARfgrarggawkspealgAAREQLRALQTAtntvlglvadahyprlpAKYQGAIGAKSAERAGAVEELGVA 1889
Cdd:COG1196  373 ELAEAEEELEE----------------LAEELLEALRAA-----------------AELAAQLEELEEAEEALLERLERL 419
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1890 VERHDGLLARLREDVVARVPWEMNADALGRLLAEFDALAEDLTPWAVDEFRGARALVQHRLGLYSAYAKARAQTGAGGTP 1969
Cdd:COG1196  420 EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA 499
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|....
gi 30984464 1970 PPAPAPLLVDVRALEARARSPG-ERHEPDPRTVRGRGEAYLRAR 2012
Cdd:COG1196  500 EADYEGFLEGVKAALLLAGLRGlAGAVAVLIGVEAAYEAALEAA 543
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
400-603 5.46e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 42.45  E-value: 5.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   400 EQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPagptTASSE 479
Cdd:NF033839  359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKP----EVKPQ 434
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   480 PPTPAgrpPTPAGRPPTPaNPTASSEPPTPAgrpPTPAGRPPTPaNPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEAT 559
Cdd:NF033839  435 PEKPK---PEVKPQPEKP-KPEVKPQPETPK---PEVKPQPEKP-KPEVKPQPEKPKPDNSKPQADDKKPSTPNNLSKDK 506
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 30984464   560 QKALDALRDRQPPEPPCGSLtellgrhPDTdGGVSRLAAHEAGI 603
Cdd:NF033839  507 QPSNQASTNEKATNKPKKSL-------PST-GSISNLALEIAGL 542
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
452-530 5.64e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 42.26  E-value: 5.64e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464   452 SEPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASS 530
Cdd:PRK14948  361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSL 439
PRK10672 PRK10672
endolytic peptidoglycan transglycosylase RlpA;
456-559 5.68e-03

endolytic peptidoglycan transglycosylase RlpA;


Pssm-ID: 236733 [Multi-domain]  Cd Length: 361  Bit Score: 41.97  E-value: 5.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   456 TPAGPTTASSEPPTPAGPTTASSEPPTPagrPPTPAGRPPTpaNPTASSEPPTPAGRPPTPAGRPPTPANP--TASSEPP 533
Cdd:PRK10672  186 TVAKQSYALPARPDLSGGMGTPSVQPAP---APQGDVLPVS--NSTLKSEDPTGAPVTSSGFLGAPTTLAPgvLEGSEPT 260
                          90       100
                  ....*....|....*....|....*.
gi 30984464   534 TPNPEGAPAPSSNEQPPAAASTDEAT 559
Cdd:PRK10672  261 PTAPSSAPATAPAAAAPQAAATSSSA 286
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
491-568 5.75e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 42.42  E-value: 5.75e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 30984464   491 AGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTA-SSEPPTPNPEGAPAPSsneqPPAAASTDEATQKALDALRD 568
Cdd:PRK14965  378 ERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARpAAARPAPAPAPPAAAA----PPARSADPAAAASAGDRWRA 452
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1661-1880 5.87e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.59  E-value: 5.87e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1661 RDLGEAKDALVRAKQQRaeatdrvtAALREALAAHER--QARSEAESLANLKTLLRVaaipaaaaktLEQARSVAEIVDQ 1738
Cdd:COG4913  235 DDLERAHEALEDAREQI--------ELLEPIRELAERyaAARERLAELEYLRAALRL----------WFAQRRLELLEAE 296
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1739 IELLLEQTEKA-AELDQAAVDWLEHARRVFEAHpltAARDGSP-DPLARLHARLDALGEtrrrtaalrrSLEAAEAEWDE 1816
Cdd:COG4913  297 LEELRAELARLeAELERLEARLDALREELDELE---AQIRGNGgDRLEQLEREIERLER----------ELEERERRRAR 363
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464 1817 VWARFGRARGGAWKSPEALGAAREQLRALQTATNTVLGLVADAHYPRLPAKYQG--AIGAKSAERA 1880
Cdd:COG4913  364 LEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLrrELRELEAEIA 429
PHA03418 PHA03418
hypothetical E4 protein; Provisional
495-582 5.97e-03

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 41.26  E-value: 5.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   495 PTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPPEP 574
Cdd:PHA03418   37 PAPHHPNPQEDPDKNPSPPPDPPLTPRPPAQPNGHNKPPVTKQPGGEGTEEDHQAPLAADADDDPRPGKRSKADEHGPAP 116

                  ....*...
gi 30984464   575 PCGSLTEL 582
Cdd:PHA03418  117 GRAALAPF 124
PTZ00144 PTZ00144
dihydrolipoamide succinyltransferase; Provisional
488-570 6.06e-03

dihydrolipoamide succinyltransferase; Provisional


Pssm-ID: 240289 [Multi-domain]  Cd Length: 418  Bit Score: 41.98  E-value: 6.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   488 PTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAPSSneqPPAAASTDEATQKALDALR 567
Cdd:PTZ00144  120 TGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKP---PPTPVARADPRETRVPMSR 196

                  ...
gi 30984464   568 DRQ 570
Cdd:PTZ00144  197 MRQ 199
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
455-519 6.21e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 42.04  E-value: 6.21e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTPAgRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGR 519
Cdd:PRK14965  382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPA-APARPAAARPAPAPAPPAAAAPPARSADPAAAAS 445
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
488-561 6.57e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 42.26  E-value: 6.57e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 30984464   488 PTPAGRPPTPANPTASSEPPTPAGRPPTPAgRPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQK 561
Cdd:PRK14948  518 SNTAKTPPPPQKSPPPPAPTPPLPQPTATA-PPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTK 590
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
370-594 6.91e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.87  E-value: 6.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    370 HHPKRASLPTRTRR-------SARHAATPFSRGSGGDEQTRPAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPP 442
Cdd:pfam17823   90 HTPHGTDLSEPATRegaadgaASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPH 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    443 TPAGPTTASSEPPTPAGPTTASSEPPTPAGPTTASSepPTPAGRPPTPAGRPPTPANPTASSEPP--TPAGRPPTPAGRP 520
Cdd:pfam17823  170 AASPAPRTAASSTTAASSTTAASSAPTTAASSAPAT--LTPARGISTAATATGHPAAGTALAAVGnsSPAAGTVTAAVGT 247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    521 PTPAN-----------PTASSEPPTPNPEG---APA---PSSNEQPPAAASTDEATQKALDALRDRQP-----PEP-PCG 577
Cdd:pfam17823  248 VTPAAlatlaaaagtvASAAGTINMGDPHArrlSPAkhmPSDTMARNPAAPMGAQAQGPIIQVSTDQPvhntaGEPtPSP 327
                          250
                   ....*....|....*..
gi 30984464    578 SLTELLGRHPDTDGGVS 594
Cdd:pfam17823  328 SNTTLEPNTPKSVASTN 344
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
485-574 7.09e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 7.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   485 GRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAgrpptPANPTASSEPPTPNPEGAPAPSsnEQPPAAASTDEATQKALD 564
Cdd:PRK07764  384 RLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAA-----PAAAAAPAPAAAPQPAPAPAPA--PAPPSPAGNAPAGGAPSP 456
                          90
                  ....*....|
gi 30984464   565 ALRDRQPPEP 574
Cdd:PRK07764  457 PPAAAPSAQP 466
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
477-571 7.49e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 42.04  E-value: 7.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   477 SSEPPTPAGRPPTPagRPPTPANPTASSEPPTPAGRPptPAGRPPTPAnptassePPTPNPEGAPAPSSNEQppAAASTD 556
Cdd:PRK14965  381 APAPPSAAWGAPTP--AAPAAPPPAAAPPVPPAAPAR--PAAARPAPA-------PAPPAAAAPPARSADPA--AAASAG 447
                          90
                  ....*....|....*
gi 30984464   557 EATQKALDALRDRQP 571
Cdd:PRK14965  448 DRWRAFVAFVKGKKP 462
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
470-543 7.60e-03

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 38.91  E-value: 7.60e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 30984464    470 PAGPTTASSEPPTPAGRPPTPAGRPPTPANPT----ASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPNPEGAPAP 543
Cdd:pfam12526   29 FSPPESAHPDPPPPVGDPRPPVVDTPPPVSAVwvlpPPSEPAAPEPDLVPPVTGPAGPPSPLAPPAPAQKPPLPPPRP 106
PHA03369 PHA03369
capsid maturational protease; Provisional
452-550 7.79e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 41.91  E-value: 7.79e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   452 SEPPTPAGPTTASSEPPTPAGPTtassePPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSE 531
Cdd:PHA03369  357 SRVLAAAAKVAVIAAPQTHTGPA-----DRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPE 431
                          90
                  ....*....|....*....
gi 30984464   532 PPTPNPegaPAPSSNEQPP 550
Cdd:PHA03369  432 PVGPVP---PQPTNPYVMP 447
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
455-605 8.60e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 42.00  E-value: 8.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   455 PTPAGPTTASSEPPTPAGPTTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPTPANPTASSEPPT 534
Cdd:PRK08691  451 QTSAKSIQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEPFYGYGFPDNDCPPE 530
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 30984464   535 PNPEGAPAPSSNEQPPAAA---STDEATQKALDalRDRQPPEPPCGSLTE---LLGRHPDTDGGVSRLAAHEAGIAR 605
Cdd:PRK08691  531 DGAEIPPPDWEHAAPADTAgggADEEAEAGGIG--GNNTPSAPPPEFSTEnwaAIVRHFARKLGAAQMPAQHSAWTE 605
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
456-561 8.80e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 41.55  E-value: 8.80e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464  456 TPAGPTTASSePPTP-------------AGPTTASSEPP-----TPAGRPPTPAGRPPTPANPTASSEPPTPAGR----- 512
Cdd:COG5164  100 TPAGDGGATG-PPDDggatgppddggstTPPSGGSTTPPgdggsTPPGPGSTGPGGSTTPPGDGGSTTPPGPGGSttppd 178
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464  513 ------PPTPAG-RPPTPANPTASSEPPTPNPEGAPAPSSNEQPPAAASTDEATQK 561
Cdd:COG5164  179 dggsttPPNKGEtGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQR 234
COG3903 COG3903
Predicted ATPase [General function prediction only];
1365-1829 8.91e-03

Predicted ATPase [General function prediction only];


Pssm-ID: 443109 [Multi-domain]  Cd Length: 933  Bit Score: 41.93  E-value: 8.91e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1365 RALQELGKVVAATRRRADELEAAAADLAEKLADRDARAGRERWAADVEAALDR-VENRAEFDAVELRRLQALAAQNKYNP 1443
Cdd:COG3903  482 AEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLARLDAEHDNLRAALRWaLAHGDAELALRLAAALAPFWFLRGLL 561
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1444 RDFRKRAEQALAANAKTATLALEAAFAfnpytpenqhhpalppLAAARRIDWGPAFGAAAETYAEMFRVDTEPLARLLRI 1523
Cdd:COG3903  562 REGRRWLERALAAAGEAAAALAAAAAL----------------AAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLL 625
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1524 TGGLLDLAQAGGGFIDYHEAVSRLAEDLNGVPSLRHYVPFFRRGHAEYLELCDRLDALRADVHRALGGVPLDLAAAAEQT 1603
Cdd:COG3903  626 LAALAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAA 705
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1604 VRLRGDPAAAAELVRTGVTLACPSEDALAACVGALERVDQAPVKDTAYAEHVAFVARRDLGEAKDALVRAKQQRAEATDR 1683
Cdd:COG3903  706 AAAAALAAAAAAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAA 785
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1684 VTAALREALAAHERQARSEAESLANLKTLLRVAAIPAAAAKTLEQARSVAEIVDQIELLLEQTEKAAELDQAAVDWLEHA 1763
Cdd:COG3903  786 LAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAA 865
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464 1764 RRVFEAHPLTAARDGSPDPLARLHARLDALGETRRRTAALRRSLEAAEAEWDEVWARFGRARGGAW 1829
Cdd:COG3903  866 AAAALAAAAAAAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
KAR9 pfam08580
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ...
452-572 9.55e-03

Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.


Pssm-ID: 430088 [Multi-domain]  Cd Length: 684  Bit Score: 41.74  E-value: 9.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464    452 SEPPTPAG-PTTASSEPPTPAGPTTASS------------EPPTPAGRPPTP-AGRPPTPANPTAS------SEPPTPAG 511
Cdd:pfam08580  514 SETPTPALrPPSRPQPPPPGNRPRWNAStntndldvghnfKPLTLTTPSPTPsRSSRSSSTLPPVSplsrdkSRSPAPTC 593
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 30984464    512 RPPTPAGRPPTPANPTAS----------SEPPTPN---PEGAPAPSSNEQPPAAASTDEATQKALDALRDRQPP 572
Cdd:pfam08580  594 RSVSRASRRRASRKPTRIgspnsrtsllDEPPYPKltlSKGLPRTPRNRQSYAGTSPSRSVSVSSGLGPQTRPG 667
PRK13855 PRK13855
type IV secretion system protein VirB10; Provisional
466-577 9.66e-03

type IV secretion system protein VirB10; Provisional


Pssm-ID: 172376  Cd Length: 376  Bit Score: 41.43  E-value: 9.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   466 EPPTPAGPTTASSEPPTPAgrpptPAGRPPTPAnptassepptpagrPPTPAGRPPTPANPTASSEPPTPNPEGAP--AP 543
Cdd:PRK13855   58 EPAPPSTMIATNTKPFHPA-----PIDVPPDPP--------------AAQEAVQPTAPPSAQSEPERNEPRPEETPifAY 118
                          90       100       110
                  ....*....|....*....|....*....|....
gi 30984464   544 SSNEQPPAAASTDEATQKALDAlrDRQPPEPPCG 577
Cdd:PRK13855  119 SSGDQGGSKRAGHGDTDRRQDD--NREDNSLPAG 150
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
404-539 9.76e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.62  E-value: 9.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464   404 PAAGPRPPTPASRPPTPGAPPTPGAPPTPGAPPTPGAPPTPAGPTTASSEPPTPAGPTTASSEPPTPAGPTTASSEPPTP 483
Cdd:PRK14951  367 AAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVA 446
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 30984464   484 AGRPPTPAGRPPTPANPTASSEPPTPAGRPPTPAGRPPtpanptasSEPPTPNPEG 539
Cdd:PRK14951  447 LAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPA--------AARLTPTEEG 494
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1408-1821 9.90e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 41.85  E-value: 9.90e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1408 AADVEAALDRVENRAEFDAVELRRLQALAAQNKYNPRDFRKRAEQALAANAKTATLALEAAFAFNPYTPENQHHPALPPL 1487
Cdd:COG1196  442 EALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRG 521
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1488 AA---ARRIDWGPAFGAAAETYAEmfrvdteplARLLRITGGLLDLAQagggfidyhEAVSRLAEDLNGvpslrhyvpff 1564
Cdd:COG1196  522 LAgavAVLIGVEAAYEAALEAALA---------AALQNIVVEDDEVAA---------AAIEYLKAAKAG----------- 572
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1565 rRGHAEYLELCDRLDALRADVHRALGGVPLDLAAAAEQTVRLRGDPAAAAELVRTGVTLACPSEDALAacvgalervdqa 1644
Cdd:COG1196  573 -RATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRA------------ 639
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1645 pvKDTAYAEHVAFVARRDLGEAKDALVRAKQQRAEATDRVTAALREALAAHERQARSEAESLANLKTLLRvaaipaaaak 1724
Cdd:COG1196  640 --VTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEER---------- 707
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 30984464 1725 tLEQARSVAEIVDQIELLLEQTEKAAELDQAAVDWLEHARRVFEAhplTAARDGSPDPLARLHARLDALgetrrrtaalR 1804
Cdd:COG1196  708 -ELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEE---ALEELPEPPDLEELERELERL----------E 773
                        410       420
                 ....*....|....*....|....
gi 30984464 1805 RSLE-------AAEAEWDEVWARF 1821
Cdd:COG1196  774 REIEalgpvnlLAIEEYEELEERY 797
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH