NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568968630|ref|XP_006514220|]
View 

R3H domain-containing protein 2 isoform X1 [Mus musculus]

Protein Classification

R3H domain-containing protein( domain architecture ID 11552619)

R3H domain-containing protein with SUZ and Med15 domains, may bind ssDNA or ssRNA in a sequence-specific manner; similar to Mus musculus R3H domain-containing protein 2

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
168-229 4.31e-27

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


:

Pssm-ID: 100071  Cd Length: 63  Bit Score: 104.61  E-value: 4.31e-27
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968630  168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
250-303 2.26e-12

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


:

Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 62.72  E-value: 2.26e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568968630   250 EFQQRFILKRDDASMD--RDDNQMRVPLQDGRRSKSIEEREEEYQRVRERIFARET 303
Cdd:pfam12752    1 PPPKMKILRRPSSGSSssSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
451-800 8.12e-12

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 69.27  E-value: 8.12e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   451 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 528
Cdd:pfam09606  124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   529 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 608
Cdd:pfam09606  204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   609 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 683
Cdd:pfam09606  282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   684 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 758
Cdd:pfam09606  362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 568968630   759 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 800
Cdd:pfam09606  442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
 
Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
168-229 4.31e-27

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100071  Cd Length: 63  Bit Score: 104.61  E-value: 4.31e-27
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968630  168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
R3H smart00393
Putative single-stranded nucleic acids-binding domain;
152-229 3.03e-14

Putative single-stranded nucleic acids-binding domain;


Pssm-ID: 214647  Cd Length: 79  Bit Score: 68.87  E-value: 3.03e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630    152 IDLHEFLVNTLKKNPRDRMMLLKLEQEILDFINdNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTGKA--VIINKT 229
Cdd:smart00393    1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGPKrrVVISKK 79
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
250-303 2.26e-12

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 62.72  E-value: 2.26e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568968630   250 EFQQRFILKRDDASMD--RDDNQMRVPLQDGRRSKSIEEREEEYQRVRERIFARET 303
Cdd:pfam12752    1 PPPKMKILRRPSSGSSssSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
451-800 8.12e-12

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 69.27  E-value: 8.12e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   451 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 528
Cdd:pfam09606  124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   529 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 608
Cdd:pfam09606  204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   609 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 683
Cdd:pfam09606  282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   684 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 758
Cdd:pfam09606  362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 568968630   759 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 800
Cdd:pfam09606  442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
R3H pfam01424
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ...
170-228 2.46e-10

R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.


Pssm-ID: 460206  Cd Length: 60  Bit Score: 57.12  E-value: 2.46e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568968630   170 MMLLKLEQEILDFINDNNNQFKkFPQMTSYHRMLLHRVAAYFGMDHNV--DQTGKAVIINK 228
Cdd:pfam01424    1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
PHA03379 PHA03379
EBNA-3A; Provisional
469-848 2.67e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 51.60  E-value: 2.67e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  469 PVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFgqmSLSRQGSTEAADPSSALFQP--PLISQHPQQASFI 546
Cdd:PHA03379  447 PVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVV---QDGRPACAPVPAPAGPIVRPweASLSQVPGVAFAP 523
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  547 MAS---AGQPLPTSNYSTSSHAPPtqqvLPPQGYMQPP------QQIQVSYYPPGQYPNSNQQYRPLshPVAYSPQRGQQ 617
Cdd:PHA03379  524 VMPqpmPVEPVPVPTVALERPVCP----APPLIAMQGPgetsgiVRVRERWRPAPWTPNPPRSPSQM--SVRDRLARLRA 597
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  618 LPQASQQP-GLQPMMSNQQQTayqgmlgvQQPQNQGLLSNQRSSMGGQMQGL--VVQYTPLPSYQVPvgSDSQNVVQPSF 694
Cdd:PHA03379  598 EAQPYQASvEVQPPQLTQVSP--------QQPMEYPLEPEQQMFPGSPFSQVadVMRAGGVPAMQPQ--YFDLPLQQPIS 667
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  695 QQPMLVPASQSVQGGLPTGGVPVYYSMIPPAQQ--NGTSPSVGFLQPPGS-------EQYQMPQSPSPCSPPQMSQQYSG 765
Cdd:PHA03379  668 QGAPLAPLRASMGPVPPVPATQPQYFDIPLTEPinQGASAAHFLPQQPMEgplvperWMFQGATLSQSVRPGVAQSQYFD 747
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  766 VSPSGPgvvvMQLNVPNGPQAPQNPSMVQWshckyysVEQRGQKPGDLYSPdGSPQANAQMGSSPVTSPTQSPAPSPVTS 845
Cdd:PHA03379  748 LPLTQP----INHGAPAAHFLHQPPMEGPW-------VPEQWMFQGAPPSQ-GTDVVQHQLDALGYVLHVLNHPGVPVSP 815

                  ...
gi 568968630  846 LSN 848
Cdd:PHA03379  816 AVN 818
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
506-675 1.79e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.57  E-value: 1.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   506 LSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASfimasAGQPLptsNYstsshaPPTQQVLPPQGYMQPPQQIQ 585
Cdd:TIGR01628  371 LQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQF-----NGQPL---GW------PRMSMMPTPMGPGGPLRPNG 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   586 VSYYPPGQYPNSNQQYRPLSHPVA---YSPQ-RGQQLPQASQQPGLQPMMSNQQQTAYQgMLGVQQPQnqgllsNQRSSM 661
Cdd:TIGR01628  437 LAPMNAVRAPSRNAQNAAQKPPMQpvmYPPNyQSLPLSQDLPQPQSTASQGGQNKKLAQ-VLASATPQ------MQKQVL 509
                          170
                   ....*....|....
gi 568968630   662 GGQMQGLVVQYTPL 675
Cdd:TIGR01628  510 GERLFPLVEAIEPA 523
Amelin smart00817
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ...
517-652 3.80e-03

Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.


Pssm-ID: 214832 [Multi-domain]  Cd Length: 411  Bit Score: 41.03  E-value: 3.80e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630    517 RQGSTEAADPSSALFQPPLISQHPQQAsfimasagqPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPN 596
Cdd:smart00817   82 REHETQQYEYSLPVHPPPLPSQPSLQP---------QQPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPPLQQAELPM 152
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968630    597 SNQQYRPLSHP-------VAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQG 652
Cdd:smart00817  153 IPPQVAPSDKPpqtelplYDFADPQNPLLFQIAHLMSRGPMPQNKQQHLYPGLFYMSYGANQL 215
PRK06462 PRK06462
asparagine synthetase A; Reviewed
140-306 4.37e-03

asparagine synthetase A; Reviewed


Pssm-ID: 235808 [Multi-domain]  Cd Length: 335  Bit Score: 40.39  E-value: 4.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  140 RDSSQEYTDSTGIDLHEFLVNTLKKnprdrmmllKLEQEILDFINDNNNQFKKFPQMTsyhrmllHRVAAYFGMDHNVDQ 219
Cdd:PRK06462  136 EGADLDEVMDLIEDLIKYLVKELLE---------EHEDELEFFGRDLPHLKRPFKRIT-------HKEAVEILNEEGCRG 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  220 TGKAVIinktssTRIPEQRFSEHIKD--------EKNTEFQQRFILKRDDASMDRDdnqMRVPLQDGRRSkSIEEREEEY 291
Cdd:PRK06462  200 IDLEEL------GSEGEKSLSEHFEEpfwiidipKGSREFYDREDPERPGVLRNYD---LLLPEGYGEAV-SGGEREYEY 269
                         170
                  ....*....|....*
gi 568968630  292 QRVRERIfaRETGQN 306
Cdd:PRK06462  270 EEIVERI--REHGVD 282
 
Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
168-229 4.31e-27

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100071  Cd Length: 63  Bit Score: 104.61  E-value: 4.31e-27
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968630  168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
R3H smart00393
Putative single-stranded nucleic acids-binding domain;
152-229 3.03e-14

Putative single-stranded nucleic acids-binding domain;


Pssm-ID: 214647  Cd Length: 79  Bit Score: 68.87  E-value: 3.03e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630    152 IDLHEFLVNTLKKNPRDRMMLLKLEQEILDFINdNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTGKA--VIINKT 229
Cdd:smart00393    1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGPKrrVVISKK 79
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
250-303 2.26e-12

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 62.72  E-value: 2.26e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568968630   250 EFQQRFILKRDDASMD--RDDNQMRVPLQDGRRSKSIEEREEEYQRVRERIFARET 303
Cdd:pfam12752    1 PPPKMKILRRPSSGSSssSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
451-800 8.12e-12

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 69.27  E-value: 8.12e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   451 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 528
Cdd:pfam09606  124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   529 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 608
Cdd:pfam09606  204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   609 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 683
Cdd:pfam09606  282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   684 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 758
Cdd:pfam09606  362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 568968630   759 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 800
Cdd:pfam09606  442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
R3H cd02325
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most ...
172-228 3.86e-11

R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100064  Cd Length: 59  Bit Score: 59.16  E-value: 3.86e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568968630  172 LLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG--KAVIINK 228
Cdd:cd02325     1 REEREEELEAFAKDAAGKSLELPPMNSYERKLIHDLAEYYGLKSESEGEGpnRRVVITK 59
R3H pfam01424
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ...
170-228 2.46e-10

R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.


Pssm-ID: 460206  Cd Length: 60  Bit Score: 57.12  E-value: 2.46e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568968630   170 MMLLKLEQEILDFINDNNNQFKkFPQMTSYHRMLLHRVAAYFGMDHNV--DQTGKAVIINK 228
Cdd:pfam01424    1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
471-699 9.66e-08

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 56.20  E-value: 9.66e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   471 PALQPSPQPVQFSPSSCPQVLLPVSPPQQ-----------YNMAEDLSNPFGQMSLSRQGSTEAADPSSALFQPPLISQH 539
Cdd:pfam09770  107 PAARAAQSSAQPPASSLPQYQYASQQSQQpskpvrtgyekYKEPEPIPDLQVDASLWGVAPKKAAAPAPAPQPAAQPASL 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   540 PQQASFIM------------ASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQvsyyPPGQYPNSNQQYRPLSHP 607
Cdd:pfam09770  187 PAPSRKMMsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ----QPQQQPQQPQQHPGQGHP 262
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   608 VAYspqrgQQLPQASQQPGLQPMMSNQQQTAYQGMLGV-QQP----QNQGLLSNQRSSMGGQMQGlVVQYTPLPSYQVPV 682
Cdd:pfam09770  263 VTI-----LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVpVQPtqilQNPNRLSAARVGYPQNPQP-GVQPAPAHQAHRQQ 336
                          250
                   ....*....|....*..
gi 568968630   683 GSDSQNVVQPSFQQPML 699
Cdd:pfam09770  337 GSFGRQAPIITHPQQLA 353
PHA03379 PHA03379
EBNA-3A; Provisional
469-848 2.67e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 51.60  E-value: 2.67e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  469 PVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFgqmSLSRQGSTEAADPSSALFQP--PLISQHPQQASFI 546
Cdd:PHA03379  447 PVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVV---QDGRPACAPVPAPAGPIVRPweASLSQVPGVAFAP 523
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  547 MAS---AGQPLPTSNYSTSSHAPPtqqvLPPQGYMQPP------QQIQVSYYPPGQYPNSNQQYRPLshPVAYSPQRGQQ 617
Cdd:PHA03379  524 VMPqpmPVEPVPVPTVALERPVCP----APPLIAMQGPgetsgiVRVRERWRPAPWTPNPPRSPSQM--SVRDRLARLRA 597
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  618 LPQASQQP-GLQPMMSNQQQTayqgmlgvQQPQNQGLLSNQRSSMGGQMQGL--VVQYTPLPSYQVPvgSDSQNVVQPSF 694
Cdd:PHA03379  598 EAQPYQASvEVQPPQLTQVSP--------QQPMEYPLEPEQQMFPGSPFSQVadVMRAGGVPAMQPQ--YFDLPLQQPIS 667
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  695 QQPMLVPASQSVQGGLPTGGVPVYYSMIPPAQQ--NGTSPSVGFLQPPGS-------EQYQMPQSPSPCSPPQMSQQYSG 765
Cdd:PHA03379  668 QGAPLAPLRASMGPVPPVPATQPQYFDIPLTEPinQGASAAHFLPQQPMEgplvperWMFQGATLSQSVRPGVAQSQYFD 747
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  766 VSPSGPgvvvMQLNVPNGPQAPQNPSMVQWshckyysVEQRGQKPGDLYSPdGSPQANAQMGSSPVTSPTQSPAPSPVTS 845
Cdd:PHA03379  748 LPLTQP----INHGAPAAHFLHQPPMEGPW-------VPEQWMFQGAPPSQ-GTDVVQHQLDALGYVLHVLNHPGVPVSP 815

                  ...
gi 568968630  846 LSN 848
Cdd:PHA03379  816 AVN 818
PRK10263 PRK10263
DNA translocase FtsK; Provisional
449-625 9.62e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.08  E-value: 9.62e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  449 PALPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFGQMSLSRQGSTEAADPSS 528
Cdd:PRK10263  336 PVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQP 415
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  529 AlfQPPLISQHPQQASFIMASAGQPlPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYpnsnQQYRPLSHPV 608
Cdd:PRK10263  416 A--QQPYYAPAPEQPAQQPYYAPAP-EQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLY----QQPQPVEQQP 488
                         170
                  ....*....|....*..
gi 568968630  609 AYSPQRGQQLPQASQQP 625
Cdd:PRK10263  489 VVEPEPVVEETKPARPP 505
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
442-742 3.78e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.84  E-value: 3.78e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   442 QQQQQQLPALPPTPQHQPPlnnhmisqPVPALQPSPQPVQFSPSSCPQVLLPVSPP--QQYNMAEDLSNPFGQMSLSRQG 519
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPP--------PGTTQAATAGPTPSAPSVPPQGSPATSQPpnQTQSTAAPHTLIQQTPTLHPQR 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   520 STEAADPSSALFQPPLISQHPQQA--SFIMASAGQPLPTSNYSTSSHAPptqQVLPPQGYMQPPQ--QIQVSYYPPGQYP 595
Cdd:pfam03154  242 LPSPHPPLQPMTQPPPPSQVSPQPlpQPSLHGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQssQSQVPPGPSPAAP 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   596 NSNQQ--YRPLSHPVAYSPQ--RGQQLPQASQQ------------PGLQPMMSNQQQTAYQGMLGVQQPQN--------- 650
Cdd:pfam03154  319 GQSQQriHTPPSQSQLQSQQppREQPLPPAPLSmphikpppttpiPQLPNPQSHKHPPHLSGPSPFQMNSNlppppalkp 398
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   651 -QGLLSNQRSSMGGQMQGLVVQYTPLPS--YQVPVGSDSQNVVQPSFQQPmlvPASQSVQGG----------LPTGGVPV 717
Cdd:pfam03154  399 lSSLSTHHPPSAHPPPLQLMPQSQQLPPppAQPPVLTQSQSLPPPAASHP---PTSGLHQVPsqspfpqhpfVPGGPPPI 475
                          330       340
                   ....*....|....*....|....*
gi 568968630   718 YYSMIPPAQqngTSPSVGFLQPPGS 742
Cdd:pfam03154  476 TPPSGPPTS---TSSAMPGIQPPSS 497
PHA03247 PHA03247
large tegument protein UL36; Provisional
357-619 6.30e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 6.30e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  357 PRPWSS----TDSDGSVRSMRPPVTKASSfsgisilTRGDSIGSSKGGSAGRLSRPGMALGAPEVC---NQVTSPQSVRG 429
Cdd:PHA03247 2712 PHALVSatplPPGPAAARQASPALPAAPA-------PPAVPAGPATPGGPARPARPPTTAGPPAPAppaAPAAGPPRRLT 2784
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  430 LLPCTAQQQQQQQQQQQQLPALPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQvllPVSPPQQYNMAEDLSNP 509
Cdd:PHA03247 2785 RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG---PPPPSLPLGGSVAPGGD 2861
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  510 FGQMSLSRQGSTEAADPS----SALFQPPLisqHPQQASFIMASAGQPLPTSnySTSSHAPPTQQVLPPQGYMQPPQQIQ 585
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPArppvRRLARPAV---SRSTESFALPPDQPERPPQ--PQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                         250       260       270
                  ....*....|....*....|....*....|....
gi 568968630  586 VSYYPPGQyPNSNQQYRPLSHPVAYSPQRGQQLP 619
Cdd:PHA03247 2937 PRPQPPLA-PTTDPAGAGEPSGAVPQPWLGALVP 2969
PRK10263 PRK10263
DNA translocase FtsK; Provisional
526-677 8.12e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.00  E-value: 8.12e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  526 PSSALFQPPLI-SQHPQQasfimasagQPLPTSNYSTSSH-APPTQQVLPPQGYMQPPQQiqvsYYPPGQYPNSNQQYRP 603
Cdd:PRK10263  740 PHEPLFTPIVEpVQQPQQ---------PVAPQQQYQQPQQpVAPQPQYQQPQQPVAPQPQ----YQQPQQPVAPQPQYQQ 806
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568968630  604 LSHPVAYSPQRGQ-QLPQASQQPGLQPmmsnQQQTAyqgmlgvqqPQNQGLLSNQRSSMGGQMQGLVVQYTPLPS 677
Cdd:PRK10263  807 PQQPVAPQPQYQQpQQPVAPQPQYQQP----QQPVA---------PQPQDTLLHPLLMRNGDSRPLHKPTTPLPS 868
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
506-675 1.79e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.57  E-value: 1.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   506 LSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASfimasAGQPLptsNYstsshaPPTQQVLPPQGYMQPPQQIQ 585
Cdd:TIGR01628  371 LQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQF-----NGQPL---GW------PRMSMMPTPMGPGGPLRPNG 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   586 VSYYPPGQYPNSNQQYRPLSHPVA---YSPQ-RGQQLPQASQQPGLQPMMSNQQQTAYQgMLGVQQPQnqgllsNQRSSM 661
Cdd:TIGR01628  437 LAPMNAVRAPSRNAQNAAQKPPMQpvmYPPNyQSLPLSQDLPQPQSTASQGGQNKKLAQ-VLASATPQ------MQKQVL 509
                          170
                   ....*....|....
gi 568968630   662 GGQMQGLVVQYTPL 675
Cdd:TIGR01628  510 GERLFPLVEAIEPA 523
PHA03377 PHA03377
EBNA-3C; Provisional
449-736 7.43e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 43.50  E-value: 7.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  449 PALPPTPQHQPPLNNHMISQPVPALQpsPQPVQFSPSScpqvllPVSPPQQYNMAEDLS--NPFGQMSLSRQGSTEAADP 526
Cdd:PHA03377  672 PATQSTPPRPSWLPSVFVLPSVDAGR--AQPSEESHLS------SMSPTQPISHEEQPRyeDPDDPLDLSLHPDQAPPPS 743
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  527 SSALFQPPLISQHPQqasfimasagqplptSNYSTSSHAPPTQqvLPPQGYMQPP-QQIQVSYYPPGQYP-NSNQQYRPL 604
Cdd:PHA03377  744 HQAPYSGHEEPQAQQ---------------APYPGYWEPRPPQ--APYLGYQEPQaQGVQVSSYPGYAGPwGLRAQHPRY 806
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  605 SHPVAYSPQR----GQQLPQASQQPGLQPmmsnqqqtAYQGMLGVQQPQNQGLLSNQRSSMGGQMQGLVVQYTPLPsyQV 680
Cdd:PHA03377  807 RHSWAYWSQYpghgHPQGPWAPRPPHLPP--------QWDGSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPYS--QT 876
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568968630  681 PVGSDSQNVVQPSFQQPMlvpasQSVQGGLPTGGVPVYYSMIPPAQQNGTS-PSVGF 736
Cdd:PHA03377  877 LVSSSAPSWSSPQPRAPI-----RPIPTRFPPPPMPLQDSMAVGCDSSGTAcPSMPF 928
G_path_suppress pfam15991
G-protein pathway suppressor; This family of proteins inhibits G-protein- and ...
472-631 1.22e-03

G-protein pathway suppressor; This family of proteins inhibits G-protein- and mitogen-activated protein kinase-mediated signal transduction.


Pssm-ID: 464961 [Multi-domain]  Cd Length: 272  Bit Score: 41.83  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   472 ALQPSPQPVQFSPSS-CPQVLLPVSPPQQYNMAEDLSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASFIMASA 550
Cdd:pfam15991  103 ALQQAAAQVFLPQLSmQGQPHHQQHPGPQVGVLKRTRSPSPPVQQQAYYKQPAFSPGYAEHGQQKHDDGRRGYDVARFGS 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   551 GQPLPTSNYSTSSHAPPTQQVLPP-QGYMQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAY---SPQRGQQLPQA--SQQ 624
Cdd:pfam15991  183 WNKSTAQYPPSGQLFYPTHQYLPPpQTQGQADARLQTIYPQPGYALPLQQQYEHANQPSPFvssSPLKQMQSPKAgpGPQ 262

                   ....*..
gi 568968630   625 PGLQPMM 631
Cdd:pfam15991  263 PMQLSVL 269
PRK10263 PRK10263
DNA translocase FtsK; Provisional
520-719 1.64e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.76  E-value: 1.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  520 STEAADPSSALFQPPLISQHPQqasfimaSAGQPLPTSNYSTSSHAPPtqqvlpPQGYMQPPQ--QIQVSYYPPGQYPNS 597
Cdd:PRK10263  336 PVEPVTQTPPVASVDVPPAQPT-------VAWQPVPGPQTGEPVIAPA------PEGYPQQSQyaQPAVQYNEPLQQPVQ 402
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  598 NQQyrPLSHPVAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQgLLSNQRSsmggqmqglvvqYTPLPS 677
Cdd:PRK10263  403 PQQ--PYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQS-TFAPQST------------YQTEQT 467
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 568968630  678 YQVPVGSDSQNVVQPSFQQPMLVPASQSVQGGLPTGGvPVYY 719
Cdd:PRK10263  468 YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARP-PLYY 508
PRK10263 PRK10263
DNA translocase FtsK; Provisional
356-743 3.09e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.61  E-value: 3.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  356 EPRPWSSTDSDGSVRSMrPPVTKASSfsgISILTRGDSIGSSKGGSAGRLSRPGMAL---GAPEvcnqvtsPQSVRGLLP 432
Cdd:PRK10263  536 EPEPIKSSLKAPSVAAV-PPVEAAAA---VSPLASGVKKATLATGAAATVAAPVFSLansGGPR-------PQVKEGIGP 604
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  433 ctaqqqqqqqqqqqqlpALPPTPQHQPPLNNHMISQPV--PALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAED-LSNP 509
Cdd:PRK10263  605 -----------------QLPRPKRIRVPTRRELASYGIklPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDeLARQ 667
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  510 FGQMSLSRQGST-------EAADPSSALfQPPLISQ--HPQQASFimaSAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQP 580
Cdd:PRK10263  668 FAQTQQQRYGEQyqhdvpvNAEDADAAA-EAELARQfaQTQQQRY---SGEQPAGANPFSLDDFEFSPMKALLDDGPHEP 743
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  581 pqqiqvsYYPPGQYPNSNQQYRPLSHPVAYSPQRgqqlPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQrss 660
Cdd:PRK10263  744 -------LFTPIVEPVQQPQQPVAPQQQYQQPQQ----PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ--- 809
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  661 mggqmqglvvQYTPLPSYQVPvgsDSQNVVQPSFQQPMLVPASQSVQgglptggvpvyySMIPP-AQQNGTS-------- 731
Cdd:PRK10263  810 ----------PVAPQPQYQQP---QQPVAPQPQYQQPQQPVAPQPQD------------TLLHPlLMRNGDSrplhkptt 864
                         410
                  ....*....|....
gi 568968630  732 --PSVGFLQPPGSE 743
Cdd:PRK10263  865 plPSLDLLTPPPSE 878
Amelin smart00817
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ...
517-652 3.80e-03

Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.


Pssm-ID: 214832 [Multi-domain]  Cd Length: 411  Bit Score: 41.03  E-value: 3.80e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630    517 RQGSTEAADPSSALFQPPLISQHPQQAsfimasagqPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPN 596
Cdd:smart00817   82 REHETQQYEYSLPVHPPPLPSQPSLQP---------QQPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPPLQQAELPM 152
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968630    597 SNQQYRPLSHP-------VAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQG 652
Cdd:smart00817  153 IPPQVAPSDKPpqtelplYDFADPQNPLLFQIAHLMSRGPMPQNKQQHLYPGLFYMSYGANQL 215
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
613-743 4.10e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.95  E-value: 4.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   613 QRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQ----PQNQGLLSNQRSSMGGQMQGLVVQYTPLPSYQVPvGSDSQN 688
Cdd:TIGR01628  374 QFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQfngqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP-SRNAQN 452
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568968630   689 VVQPSFQQPMlvPASQSVQGGLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSE 743
Cdd:TIGR01628  453 AAQKPPMQPV--MYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQ 505
PRK06462 PRK06462
asparagine synthetase A; Reviewed
140-306 4.37e-03

asparagine synthetase A; Reviewed


Pssm-ID: 235808 [Multi-domain]  Cd Length: 335  Bit Score: 40.39  E-value: 4.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  140 RDSSQEYTDSTGIDLHEFLVNTLKKnprdrmmllKLEQEILDFINDNNNQFKKFPQMTsyhrmllHRVAAYFGMDHNVDQ 219
Cdd:PRK06462  136 EGADLDEVMDLIEDLIKYLVKELLE---------EHEDELEFFGRDLPHLKRPFKRIT-------HKEAVEILNEEGCRG 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  220 TGKAVIinktssTRIPEQRFSEHIKD--------EKNTEFQQRFILKRDDASMDRDdnqMRVPLQDGRRSkSIEEREEEY 291
Cdd:PRK06462  200 IDLEEL------GSEGEKSLSEHFEEpfwiidipKGSREFYDREDPERPGVLRNYD---LLLPEGYGEAV-SGGEREYEY 269
                         170
                  ....*....|....*
gi 568968630  292 QRVRERIfaRETGQN 306
Cdd:PRK06462  270 EEIVERI--REHGVD 282
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
412-625 5.70e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 40.72  E-value: 5.70e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  412 LGAPEVCNQVTSPQSVRGLLPCTAQQQQQQQQQQQQLPALPPTPQHQPPLNN-HMISQPVPALQPSPQPVQfspsscpqv 490
Cdd:PRK14948  360 LPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKAsPPIPVPAEPTEPSPTPPA--------- 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  491 lLPVSPPQQYNMAE-------DLSNPFGQMSLSRQGSTEAADPSSALFQ------------PPLIS-------QHPQQAS 544
Cdd:PRK14948  431 -NAANAPPSLNLEElwqqilaKLELPSTRMLLSQQAELVSLDSNRAVIAvspnwlgmvqsrKPLLEqafakvlGRSIKLN 509
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630  545 FIMASAGQPlptsnySTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAYSPQRGQQLPQASQQ 624
Cdd:PRK14948  510 LESQSGSAS------NTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEE 583

                  .
gi 568968630  625 P 625
Cdd:PRK14948  584 P 584
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
512-740 6.14e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 40.79  E-value: 6.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   512 QMSLSRQGSTEAADPSSALfQPPLISQHPQQASFIMASAGQPLPTS--NYSTSSHAPPTQ--QVL---PPQGYMQPPQQI 584
Cdd:pfam09770   99 QVRFNRQQPAARAAQSSAQ-PPASSLPQYQYASQQSQQPSKPVRTGyeKYKEPEPIPDLQvdASLwgvAPKKAAAPAPAP 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   585 QVSYYPPGQYPNSNQ---------QYRplshpvAYSPQRGQQLPQASQQP--GLQPMMSNQQQTAYQGMLGVQQPQNQGL 653
Cdd:pfam09770  178 QPAAQPASLPAPSRKmmsleeveaAMR------AQAKKPAQQPAPAPAQPpaAPPAQQAQQQQQFPPQIQQQQQPQQQPQ 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   654 LSNQRSSMGGQMQGL---------VVQYTPLPSYQVPVGSDSQNVVQPS--FQQPMLVPASQSVQGGLPTGGVPVyysmi 722
Cdd:pfam09770  252 QPQQHPGQGHPVTILqrpqspqpdPAQPSIQPQAQQFHQQPPPVPVQPTqiLQNPNRLSAARVGYPQNPQPGVQP----- 326
                          250
                   ....*....|....*...
gi 568968630   723 PPAQQNGTSPSVGFLQPP 740
Cdd:pfam09770  327 APAHQAHRQQGSFGRQAP 344
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
548-868 7.32e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 7.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   548 ASAGQPLPTSNYSTSShAPPTQQVLPPQgymQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAYSPQRGQQLPQASQQPGL 627
Cdd:pfam03154  144 TSPSIPSPQDNESDSD-SSAQQQILQTQ---PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   628 QPMMSNQQQTAYQGMLGVQQPQnqglLSNQRSSMggqmqglvvqyTPLPSYQVPVGSDSQNVVQPSFQQPMlVPASQSVQ 707
Cdd:pfam03154  220 QTQSTAAPHTLIQQTPTLHPQR----LPSPHPPL-----------QPMTQPPPPSQVSPQPLPQPSLHGQM-PPMPHSLQ 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   708 GGLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQMSQQYSGVSPSGPGVVVM-QLNVPNGPQA 786
Cdd:pfam03154  284 TGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMpHIKPPPTTPI 363
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968630   787 PQNPSMVQWSHCKYYSVEQRGQKPGDLYSPDG-----------SPQAN----------AQMGSSPVTSP--TQSPAPSPV 843
Cdd:pfam03154  364 PQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAlkplsslsthhPPSAHppplqlmpqsQQLPPPPAQPPvlTQSQSLPPP 443
                          330       340
                   ....*....|....*....|....*....
gi 568968630   844 TSLSNVCTGLSPLPVLTPFPR----PGGP 868
Cdd:pfam03154  444 AASHPPTSGLHQVPSQSPFPQhpfvPGGP 472
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH