NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907117193|ref|XP_036015745|]
View 

eukaryotic translation initiation factor 4 gamma 1 isoform X11 [Mus musculus]

Protein Classification

MA3 and W2_eIF4G1_like domain-containing protein( domain architecture ID 13556876)

protein containing domains MIF4G, MA3, W2_eIF4G1_like, and W2

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
718-946 2.50e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.50e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  718 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 797
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  798 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 877
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907117193  878 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 946
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1394-1523 1.12e-54

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 186.72  E-value: 1.12e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1394 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1472
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907117193 1473 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1523
Cdd:cd11559     84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1195-1307 4.88e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.70  E-value: 4.88e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1195 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1274
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907117193 1275 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1307
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
156-415 1.09e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.83  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  156 IMSGARTASTPTPPQTGGSLEPQPNGESPQVaviirpdDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEP 235
Cdd:pfam05109  404 IITRTATNATTTTHKVIFSKAPESTTTSPTL-------NTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSP 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  236 ---GSESNLGVLS---IPGDTMTTGMIPMSVEESTPISCETgePYCLSPEPTLAEPILE-VEVTLSKPIPESEFsSSPLQ 308
Cdd:pfam05109  477 tpaGTTSGASPVTpspSPRDNGTESKAPDMTSPTSAVTTPT--PNATSPTPAVTTPTPNaTSPTLGKTSPTSAV-TTPTP 553
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  309 VSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESlVPIAPTAQpeELLNGAPSPPAVDLSP---VSEP 385
Cdd:pfam05109  554 NATSPTP-AVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-SPQANTTN--HTLGGTSSTPVVTSPPknaTSAV 629
                          250       260       270
                   ....*....|....*....|....*....|
gi 1907117193  386 EEQAKKVSSAALASiLSPAPPVAPSDTSPA 415
Cdd:pfam05109  630 TTGQHNITSSSTSS-MSLRPSSISETLSPS 658
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2-523 2.14e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 2.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193    2 NTPSQPRQHFYPSRAQPPSSAASrvqSAAPARPGPAPHVYPAGSQvmmipsqisysasqgayyiPGQGRSTYVVPTQQYP 81
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLP---PDTHAPDPPPPSPSPAANE-------------------PDPHPPPTVPPPERPR 2653
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   82 VQPGAPGFYPGASPTEFGTYAGAYYPAQGVQQ--FPASVAPAPVLMNQPPQIAPKRERktirirdPNQGGKDITEEIMSG 159
Cdd:PHA03247  2654 DDPAPGRVSRPRRARRLGRAAQASSPPQRPRRraARPTVGSLTSLADPPPPPPTPEPA-------PHALVSATPLPPGPA 2726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  160 ARTASTPTPPQTGGslePQPNGESPQVAVIIRPDDRSQGAAIGGRPGLP-GPEHSP--GTESQPSSPSPTPSPPPILEPG 236
Cdd:PHA03247  2727 AARQASPALPAAPA---PPAVPAGPATPGGPARPARPPTTAGPPAPAPPaAPAAGPprRLTRPAVASLSESRESLPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  237 SESNLGVLSIPGDTMTTGMIPMSVEESTPISCETgepyclsPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPh 316
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT-------APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKP- 2875
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  317 KVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSP-----------PAVDLSPVSEP 385
Cdd:PHA03247  2876 AAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPpppprpqpplaPTTDPAGAGEP 2955
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  386 EEQAKKVSSAALASILSPA-----PPVAPSDTSPAQEEEMEEDDDDEEGGEAESEKGgedVPLDSTPVPAQLSQNLEVAA 460
Cdd:PHA03247  2956 SGAVPQPWLGALVPGRVAVprfrvPQPAPSREAPASSTPPLTGHSLSRVSSWASSLA---LHEETDPPPVSLKQTLWPPD 3032
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907117193  461 ATQVAvSVPKRRRKIKELNKKEAVgDLLDAFKEVDPAVPEVENQPPTGSNPSPESEGSMVPTQ 523
Cdd:PHA03247  3033 DTEDS-DADSLFDSDSERSDLEAL-DPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLS 3093
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
718-946 2.50e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.50e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  718 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 797
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  798 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 877
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907117193  878 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 946
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1394-1523 1.12e-54

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 186.72  E-value: 1.12e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1394 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1472
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907117193 1473 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1523
Cdd:cd11559     84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
719-946 7.36e-53

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 184.10  E-value: 7.36e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   719 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 798
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   799 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 878
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907117193   879 KLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRQSN 946
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1195-1307 4.88e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.70  E-value: 4.88e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1195 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1274
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907117193 1275 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1307
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1195-1307 2.62e-34

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 127.75  E-value: 2.62e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  1195 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1274
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1907117193  1275 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1307
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1463-1545 2.00e-27

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 106.99  E-value: 2.00e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  1463 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFNWL 1542
Cdd:smart00515    3 LLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTWL 80

                    ...
gi 1907117193  1543 REA 1545
Cdd:smart00515   81 QEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1474-1550 6.98e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 93.75  E-value: 6.98e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907117193 1474 ELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFNWLREAEDEES 1550
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
156-415 1.09e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.83  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  156 IMSGARTASTPTPPQTGGSLEPQPNGESPQVaviirpdDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEP 235
Cdd:pfam05109  404 IITRTATNATTTTHKVIFSKAPESTTTSPTL-------NTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSP 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  236 ---GSESNLGVLS---IPGDTMTTGMIPMSVEESTPISCETgePYCLSPEPTLAEPILE-VEVTLSKPIPESEFsSSPLQ 308
Cdd:pfam05109  477 tpaGTTSGASPVTpspSPRDNGTESKAPDMTSPTSAVTTPT--PNATSPTPAVTTPTPNaTSPTLGKTSPTSAV-TTPTP 553
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  309 VSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESlVPIAPTAQpeELLNGAPSPPAVDLSP---VSEP 385
Cdd:pfam05109  554 NATSPTP-AVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-SPQANTTN--HTLGGTSSTPVVTSPPknaTSAV 629
                          250       260       270
                   ....*....|....*....|....*....|
gi 1907117193  386 EEQAKKVSSAALASiLSPAPPVAPSDTSPA 415
Cdd:pfam05109  630 TTGQHNITSSSTSS-MSLRPSSISETLSPS 658
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-523 2.14e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 2.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193    2 NTPSQPRQHFYPSRAQPPSSAASrvqSAAPARPGPAPHVYPAGSQvmmipsqisysasqgayyiPGQGRSTYVVPTQQYP 81
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLP---PDTHAPDPPPPSPSPAANE-------------------PDPHPPPTVPPPERPR 2653
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   82 VQPGAPGFYPGASPTEFGTYAGAYYPAQGVQQ--FPASVAPAPVLMNQPPQIAPKRERktirirdPNQGGKDITEEIMSG 159
Cdd:PHA03247  2654 DDPAPGRVSRPRRARRLGRAAQASSPPQRPRRraARPTVGSLTSLADPPPPPPTPEPA-------PHALVSATPLPPGPA 2726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  160 ARTASTPTPPQTGGslePQPNGESPQVAVIIRPDDRSQGAAIGGRPGLP-GPEHSP--GTESQPSSPSPTPSPPPILEPG 236
Cdd:PHA03247  2727 AARQASPALPAAPA---PPAVPAGPATPGGPARPARPPTTAGPPAPAPPaAPAAGPprRLTRPAVASLSESRESLPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  237 SESNLGVLSIPGDTMTTGMIPMSVEESTPISCETgepyclsPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPh 316
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT-------APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKP- 2875
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  317 KVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSP-----------PAVDLSPVSEP 385
Cdd:PHA03247  2876 AAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPpppprpqpplaPTTDPAGAGEP 2955
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  386 EEQAKKVSSAALASILSPA-----PPVAPSDTSPAQEEEMEEDDDDEEGGEAESEKGgedVPLDSTPVPAQLSQNLEVAA 460
Cdd:PHA03247  2956 SGAVPQPWLGALVPGRVAVprfrvPQPAPSREAPASSTPPLTGHSLSRVSSWASSLA---LHEETDPPPVSLKQTLWPPD 3032
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907117193  461 ATQVAvSVPKRRRKIKELNKKEAVgDLLDAFKEVDPAVPEVENQPPTGSNPSPESEGSMVPTQ 523
Cdd:PHA03247  3033 DTEDS-DADSLFDSDSERSDLEAL-DPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLS 3093
PHA03247 PHA03247
large tegument protein UL36; Provisional
115-528 3.40e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 3.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  115 PASVAPAPVLMNQPPQIAPK--------RERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQTGgSLEPQPNGESPqv 186
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRpsepavtsRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTH-APDPPPPSPSP-- 2633
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  187 aviirpddRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSVEESTPi 266
Cdd:PHA03247  2634 --------AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP- 2704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  267 scetgEPyclSPEPTlaePILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNGVIPSEDLEPEVESSTEPAPPP 346
Cdd:PHA03247  2705 -----PP---TPEPA---PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  347 LSPCASESLVPIAPTAQPEELLNGAPSPPAVDLSPVSEPEeqakkvSSAALASILSPAPPVAPSDTS-PAQEEEMEEDDD 425
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA------PAAALPPAASPAGPLPPPTSAqPTAPPPPPGPPP 2847
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  426 DEEGGEAESEKGGedvPLDSTPVPAQlsqnlevAAATQVAVSVPKRRRKikelnKKEAVGDLLDAFKE-VDPAVPEVENQ 504
Cdd:PHA03247  2848 PSLPLGGSVAPGG---DVRRRPPSRS-------PAAKPAAPARPPVRRL-----ARPAVSRSTESFALpPDQPERPPQPQ 2912
                          410       420
                   ....*....|....*....|....
gi 1907117193  505 PPTGSNPSPESEGSMVPTQPEETE 528
Cdd:PHA03247  2913 APPPPQPQPQPPPPPQPQPPPPPP 2936
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
326-565 1.49e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 43.35  E-value: 1.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  326 VIPSEDlEPEVESSTEPAPPPLSpCASESLVPIAPTAQPEELLNG-APSPPAVDLSPVSepeeqakkvSSAALASILSPA 404
Cdd:TIGR00600  520 VKPVSS-EFGLPSQREDKLAIPT-EGTQNLQGISDHPEQFEFQNElSPLETKNNESNLS---------SDAETEGSPNPE 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  405 PPVAPSDTSPAQEEEMEEDDDDEeggeaesekGGEDV--PLDSTPVPAQLSQnlevaAATQVAVSVPKRRRKIkELNKKE 482
Cdd:TIGR00600  589 MPSWSSVTVPSEALDNYETTNPS---------NAKEVrnFAETGIQTTNVGE-----SADLLLISNPMEVEPM-ESEKEE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  483 AVGDllDAFKEVDPAVPEVENQPPTGSNPSPESE-------GSMVPTQPEETEEtWDSKEDKIHNAENIQPGEQKYEYKS 555
Cdd:TIGR00600  654 SESD--GSFIEVDSVSSTLELQVPSKSQPTDESEenaenkvASIEGEHRKEIED-LLFDESEEDNIVGMIEEEKDADDFK 730
                          250
                   ....*....|
gi 1907117193  556 DQWKPLNLEE 565
Cdd:TIGR00600  731 NEWQDISLEE 740
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
718-946 2.50e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.50e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  718 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 797
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  798 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 877
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907117193  878 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 946
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1394-1523 1.12e-54

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 186.72  E-value: 1.12e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1394 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1472
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907117193 1473 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1523
Cdd:cd11559     84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
719-946 7.36e-53

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 184.10  E-value: 7.36e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   719 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 798
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   799 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 878
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907117193   879 KLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRQSN 946
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1195-1307 4.88e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.70  E-value: 4.88e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1195 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1274
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907117193 1275 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1307
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1195-1307 2.62e-34

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 127.75  E-value: 2.62e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  1195 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1274
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1907117193  1275 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1307
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1463-1545 2.00e-27

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 106.99  E-value: 2.00e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  1463 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFNWL 1542
Cdd:smart00515    3 LLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTWL 80

                    ...
gi 1907117193  1543 REA 1545
Cdd:smart00515   81 QEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1474-1550 6.98e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 93.75  E-value: 6.98e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907117193 1474 ELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFNWLREAEDEES 1550
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1394-1517 2.35e-19

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 85.61  E-value: 2.35e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1394 EELRRQLEKLLK-DGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFE----TPLRVDVQVLKVRARLLQKYL 1468
Cdd:cd11473      4 KKLRDSLLKELEeDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADsislTQKEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907117193 1469 CD-EQKELQALYALQALVVT--LEQPANLLRMFFDALYDEDVVKEDAFYSWE 1517
Cdd:cd11473     84 KLiKKDQLYLLLKIEKLCLQlkLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1463-1550 1.77e-13

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 69.98  E-value: 1.77e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1463 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEQQGKGVALKSVTAFFNWL 1542
Cdd:cd11558     82 LLENYVKSQDDQVELLLALEEFCLESEEGGPLFAKLLHALYDLDILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWL 161

                   ....*...
gi 1907117193 1543 REAEDEES 1550
Cdd:cd11558    162 EEAEEESD 169
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1454-1550 4.47e-07

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 51.08  E-value: 4.47e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1454 VQVLKVRARLLQKYLCDEQKELQALYALQALVVtlEQPANLLRMF---FDALYDEDVVKEDAFYSW---ESSKDPAEQQG 1527
Cdd:cd11561     58 VKEIKKRKALLLKLVTDEKAQKALLGGIERFCG--KHSPELLKKVpliLKALYDNDILEEEVILKWyekVSKKYVSKEKS 135
                           90       100
                   ....*....|....*....|...
gi 1907117193 1528 KGVaLKSVTAFFNWLREAEDEES 1550
Cdd:cd11561    136 KKV-RKAAEPFVEWLEEAEEEEE 157
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1388-1548 2.81e-06

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 49.52  E-value: 2.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1388 QRTLAFEELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASN--------TLVRALMTTVCYSA---IIFETPLRVdvqv 1456
Cdd:cd11560     29 YRKQASQEIKKELQQELKEMIAEEEPVKEIIAAVKEQMKKSSlpehevvgLLWTALMDAVEWSKkedQIAEQALRH---- 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193 1457 LKVRARLLQKYLCDEQKELQALYALQalVVTLEQpANLLRMFFD---ALYDEDVVKEDAFYSWesSKDPAEQQGKGVALK 1533
Cdd:cd11560    105 LKKYAPLLAAFCTTARAELALLNKIQ--EYCYEN-MKFMKVFQKivkLLYKADVLSEDAILKW--YKKGHSPKGKQVFLK 179
                          170
                   ....*....|....*
gi 1907117193 1534 SVTAFFNWLREAEDE 1548
Cdd:cd11560    180 QMEPFVEWLQEAEEE 194
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
156-415 1.09e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.83  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  156 IMSGARTASTPTPPQTGGSLEPQPNGESPQVaviirpdDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEP 235
Cdd:pfam05109  404 IITRTATNATTTTHKVIFSKAPESTTTSPTL-------NTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSP 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  236 ---GSESNLGVLS---IPGDTMTTGMIPMSVEESTPISCETgePYCLSPEPTLAEPILE-VEVTLSKPIPESEFsSSPLQ 308
Cdd:pfam05109  477 tpaGTTSGASPVTpspSPRDNGTESKAPDMTSPTSAVTTPT--PNATSPTPAVTTPTPNaTSPTLGKTSPTSAV-TTPTP 553
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  309 VSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESlVPIAPTAQpeELLNGAPSPPAVDLSP---VSEP 385
Cdd:pfam05109  554 NATSPTP-AVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-SPQANTTN--HTLGGTSSTPVVTSPPknaTSAV 629
                          250       260       270
                   ....*....|....*....|....*....|
gi 1907117193  386 EEQAKKVSSAALASiLSPAPPVAPSDTSPA 415
Cdd:pfam05109  630 TTGQHNITSSSTSS-MSLRPSSISETLSPS 658
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-523 2.14e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 2.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193    2 NTPSQPRQHFYPSRAQPPSSAASrvqSAAPARPGPAPHVYPAGSQvmmipsqisysasqgayyiPGQGRSTYVVPTQQYP 81
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLP---PDTHAPDPPPPSPSPAANE-------------------PDPHPPPTVPPPERPR 2653
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   82 VQPGAPGFYPGASPTEFGTYAGAYYPAQGVQQ--FPASVAPAPVLMNQPPQIAPKRERktirirdPNQGGKDITEEIMSG 159
Cdd:PHA03247  2654 DDPAPGRVSRPRRARRLGRAAQASSPPQRPRRraARPTVGSLTSLADPPPPPPTPEPA-------PHALVSATPLPPGPA 2726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  160 ARTASTPTPPQTGGslePQPNGESPQVAVIIRPDDRSQGAAIGGRPGLP-GPEHSP--GTESQPSSPSPTPSPPPILEPG 236
Cdd:PHA03247  2727 AARQASPALPAAPA---PPAVPAGPATPGGPARPARPPTTAGPPAPAPPaAPAAGPprRLTRPAVASLSESRESLPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  237 SESNLGVLSIPGDTMTTGMIPMSVEESTPISCETgepyclsPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPh 316
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT-------APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKP- 2875
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  317 KVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSP-----------PAVDLSPVSEP 385
Cdd:PHA03247  2876 AAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPpppprpqpplaPTTDPAGAGEP 2955
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  386 EEQAKKVSSAALASILSPA-----PPVAPSDTSPAQEEEMEEDDDDEEGGEAESEKGgedVPLDSTPVPAQLSQNLEVAA 460
Cdd:PHA03247  2956 SGAVPQPWLGALVPGRVAVprfrvPQPAPSREAPASSTPPLTGHSLSRVSSWASSLA---LHEETDPPPVSLKQTLWPPD 3032
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907117193  461 ATQVAvSVPKRRRKIKELNKKEAVgDLLDAFKEVDPAVPEVENQPPTGSNPSPESEGSMVPTQ 523
Cdd:PHA03247  3033 DTEDS-DADSLFDSDSERSDLEAL-DPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLS 3093
PHA03247 PHA03247
large tegument protein UL36; Provisional
115-528 3.40e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 3.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  115 PASVAPAPVLMNQPPQIAPK--------RERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQTGgSLEPQPNGESPqv 186
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRpsepavtsRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTH-APDPPPPSPSP-- 2633
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  187 aviirpddRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSVEESTPi 266
Cdd:PHA03247  2634 --------AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP- 2704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  267 scetgEPyclSPEPTlaePILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNGVIPSEDLEPEVESSTEPAPPP 346
Cdd:PHA03247  2705 -----PP---TPEPA---PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  347 LSPCASESLVPIAPTAQPEELLNGAPSPPAVDLSPVSEPEeqakkvSSAALASILSPAPPVAPSDTS-PAQEEEMEEDDD 425
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA------PAAALPPAASPAGPLPPPTSAqPTAPPPPPGPPP 2847
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  426 DEEGGEAESEKGGedvPLDSTPVPAQlsqnlevAAATQVAVSVPKRRRKikelnKKEAVGDLLDAFKE-VDPAVPEVENQ 504
Cdd:PHA03247  2848 PSLPLGGSVAPGG---DVRRRPPSRS-------PAAKPAAPARPPVRRL-----ARPAVSRSTESFALpPDQPERPPQPQ 2912
                          410       420
                   ....*....|....*....|....
gi 1907117193  505 PPTGSNPSPESEGSMVPTQPEETE 528
Cdd:PHA03247  2913 APPPPQPQPQPPPPPQPQPPPPPP 2936
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
326-565 1.49e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 43.35  E-value: 1.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  326 VIPSEDlEPEVESSTEPAPPPLSpCASESLVPIAPTAQPEELLNG-APSPPAVDLSPVSepeeqakkvSSAALASILSPA 404
Cdd:TIGR00600  520 VKPVSS-EFGLPSQREDKLAIPT-EGTQNLQGISDHPEQFEFQNElSPLETKNNESNLS---------SDAETEGSPNPE 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  405 PPVAPSDTSPAQEEEMEEDDDDEeggeaesekGGEDV--PLDSTPVPAQLSQnlevaAATQVAVSVPKRRRKIkELNKKE 482
Cdd:TIGR00600  589 MPSWSSVTVPSEALDNYETTNPS---------NAKEVrnFAETGIQTTNVGE-----SADLLLISNPMEVEPM-ESEKEE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  483 AVGDllDAFKEVDPAVPEVENQPPTGSNPSPESE-------GSMVPTQPEETEEtWDSKEDKIHNAENIQPGEQKYEYKS 555
Cdd:TIGR00600  654 SESD--GSFIEVDSVSSTLELQVPSKSQPTDESEenaenkvASIEGEHRKEIED-LLFDESEEDNIVGMIEEEKDADDFK 730
                          250
                   ....*....|
gi 1907117193  556 DQWKPLNLEE 565
Cdd:TIGR00600  731 NEWQDISLEE 740
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
280-440 3.21e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 42.00  E-value: 3.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  280 PTLAEPILEVEVTLSKPIPESEFSSSPLQV-STALVPHKVETHEPNGVIPSEDLEP---EVESSTEPAPPPLSPCASESL 355
Cdd:PRK08691   380 PSAQTAEKETAAKKPQPRPEAETAQTPVQTaSAAAMPSEGKTAGPVSNQENNDVPPwedAPDEAQTAAGTAQTSAKSIQT 459
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  356 VPIAPTAQPEEL-------------LNGAPSPPAVDLSPVSEPEEQAKKVSSAalasilsPAPPVA----PSDTSPAQEE 418
Cdd:PRK08691   460 ASEAETPPENQVsknkaadnetdapLSEVPSENPIQATPNDEAVETETFAHEA-------PAEPFYgygfPDNDCPPEDG 532
                          170       180
                   ....*....|....*....|..
gi 1907117193  419 EMEEDDDDEEGGEAESEKGGED 440
Cdd:PRK08691   533 AEIPPPDWEHAAPADTAGGGAD 554
Rib_recp_KP_reg pfam05104
Ribosome receptor lysine/proline rich region; This highly conserved region is found towards ...
321-415 3.74e-03

Ribosome receptor lysine/proline rich region; This highly conserved region is found towards the C-terminus of the transmembrane domain. The function is unclear.


Pssm-ID: 461548 [Multi-domain]  Cd Length: 140  Bit Score: 39.33  E-value: 3.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  321 HEPNGVIPseDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPAVDlSPVSEPEEQAKKVSSAALAsi 400
Cdd:pfam05104   44 EKPNGKLP--ESEQADESEEEPREFKTPDEAPSAALEPEPVPTPVPAPVEPEPAPPSE-SPAPSPKEKKKKEKKSAKV-- 118
                           90
                   ....*....|....*
gi 1907117193  401 lSPAPPVAPSDTSPA 415
Cdd:pfam05104  119 -EPAETPEAVQPKPA 132
PRK11633 PRK11633
cell division protein DedD; Provisional
299-415 4.59e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 40.37  E-value: 4.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  299 ESEFSSSPLqvstalVPHKVETHEPNGV---------IPSEDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEElln 369
Cdd:PRK11633    35 QDEFAAIPL------VPKPGDRDEPDMMpaatqalptQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVE--- 105
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1907117193  370 gAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAP-PVAPSDTSPA 415
Cdd:PRK11633   106 -PPKPK-----PVEKPKPKPKPQQKVEAPPAPKPEPkPVVEEKAAPT 146
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-313 5.40e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 5.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193    2 NTPSQPrqhfyPSRAQPPSSAASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYP 81
Cdd:PHA03247  2755 ARPARP-----PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   82 ----VQPGAPGFYPGASPTEFgTYAGAYYPAQGVQQFPASVAPAPVL----------MNQP-------PQIAPKRERKTI 140
Cdd:PHA03247  2830 pptsAQPTAPPPPPGPPPPSL-PLGGSVAPGGDVRRRPPSRSPAAKPaaparppvrrLARPavsrsteSFALPPDQPERP 2908
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  141 RIRDPNQGGKDITEEIMSGARTASTPTPPQTGGSLEPQPNGESPQVAVIIRPDDRSqGAAIGGR------------PGLP 208
Cdd:PHA03247  2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWL-GALVPGRvavprfrvpqpaPSRE 2987
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  209 GPEHSPGTESQPSSPSPTPSPPPIL-----EPGSESNLGVLSIPGDtmTTGMIPMSVEESTPISCETGEPYCLSPEPTLA 283
Cdd:PHA03247  2988 APASSTPPLTGHSLSRVSSWASSLAlheetDPPPVSLKQTLWPPDD--TEDSDADSLFDSDSERSDLEALDPLPPEPHDP 3065
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1907117193  284 ---EPILEVEVTLSKPIPESEFSSSPLQVSTAL 313
Cdd:PHA03247  3066 fahEPDPATPEAGARESPSSQFGPPPLSANAAL 3098
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
91-514 6.29e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.29  E-value: 6.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193   91 PGASPTEFGTYAGAYYPAQGVQQFPaSVAPAPVLMNQPPQIAPKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQ 170
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAA-TAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPL 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  171 TGGSLEPQPNGESPQVavIIRPDDRSQGAAiGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLgvlsiPGDT 250
Cdd:pfam03154  250 QPMTQPPPPSQVSPQP--LPQPSLHGQMPP-MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAA-----PGQS 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  251 MTTGMIPMSVEESTPISCETGEPYCLSP------EPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPhkvethePN 324
Cdd:pfam03154  322 QQRIHTPPSQSQLQSQQPPREQPLPPAPlsmphiKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPP-------PP 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  325 GVIPSEDLEPEVESSTEPAPPPLSPcASESLVPiaPTAQPEELLNGAPSPPAVDLSPVSEPEEQAKKVSSAALAS-ILSP 403
Cdd:pfam03154  395 ALKPLSSLSTHHPPSAHPPPLQLMP-QSQQLPP--PPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPfVPGG 471
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907117193  404 APPVAPSDTSPAQEEEMEEDDDDEEGGEAESekggedvpldSTPVPAqlsqnlevaaatQVAVSVPKRRRKIKELNKKEa 483
Cdd:pfam03154  472 PPPITPPSGPPTSTSSAMPGIQPPSSASVSS----------SGPVPA------------AVSCPLPPVQIKEEALDEAE- 528
                          410       420       430
                   ....*....|....*....|....*....|.
gi 1907117193  484 vgdlldafkevdpavpEVENQPPTGSNPSPE 514
Cdd:pfam03154  529 ----------------EPESPPPPPRSPSPE 543
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH