NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034628471|ref|XP_016884138|]
View 

RNA-binding protein EWS isoform X15 [Homo sapiens]

Protein Classification

RNA-binding protein( domain architecture ID 10350435)

RNA-binding protein similar to Arabidopsis thaliana serine/arginine-rich SC35-like splicing factor SCL30A involved in intron recognition and spliceosome assembly; contains an RNA recognition motif (RRM)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RRM_SF super family cl17169
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
362-431 3.57e-42

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


The actual alignment was detected with superfamily member cd12533:

Pssm-ID: 473069 [Multi-domain]  Cd Length: 84  Bit Score: 146.91  E-value: 3.57e-42
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471 362 SAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12533     1 STIYVQGLNENVTLEELADFFKHCGVVKINKRTGQPMINIYTDKETGKPKGDATVSYEDPPAAKAAVEWF 70
zf-RanBP pfam00641
Zn-finger in Ran binding protein and others;
478-520 3.49e-08

Zn-finger in Ran binding protein and others;


:

Pssm-ID: 395516 [Multi-domain]  Cd Length: 30  Bit Score: 49.27  E-value: 3.49e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1034628471 478 RAGDWQCPnpsigdfccdvivcrGCGNQNFAWRTECNQCKAPK 520
Cdd:pfam00641   1 REGDWDCS---------------KCLVQNFATSTKCVACQAPK 28
PHA03247 super family cl33720
large tegument protein UL36; Provisional
72-207 8.72e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 8.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471   72 GQPPTGYTTPTAPQA--YSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTET 149
Cdd:PHA03247  2869 RSPAAKPAAPARPPVrrLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034628471  150 SQPQSS-TGGYNQPSLGY---GQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQS 207
Cdd:PHA03247  2949 PAGAGEpSGAVPQPWLGAlvpGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS 3010
 
Name Accession Description Interval E-value
RRM_EWS cd12533
RNA recognition motif (RRM) found in vertebrate Ewing Sarcoma Protein (EWS); This subgroup ...
362-431 3.57e-42

RNA recognition motif (RRM) found in vertebrate Ewing Sarcoma Protein (EWS); This subgroup corresponds to the RRM of EWS, also termed Ewing sarcoma breakpoint region 1 protein, a member of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a multifunctional protein and may play roles in transcription and RNA processing. EWS is involved in transcriptional regulation by interacting with the preinitiation complex TFIID and the RNA polymerase II (RNAPII) complexes. It is also associated with splicing factors, such as the U1 snRNP protein U1C, suggesting its implication in pre-mRNA splicing. Additionally, EWS has been shown to regulate DNA damage-induced alternative splicing (AS). Like other members in the FET family, EWS contains an N-terminal Ser, Gly, Gln and Tyr-rich region composed of multiple copies of a degenerate hexapeptide repeat motif. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a C2/C2 zinc-finger motif, a conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and at least 1 arginine-glycine-glycine (RGG)-repeat region. EWS specifically binds to poly G and poly U RNA. It also binds to the proximal-element DNA of the macrophage-specific promoter of the CSF-1 receptor gene.


Pssm-ID: 409950 [Multi-domain]  Cd Length: 84  Bit Score: 146.91  E-value: 3.57e-42
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471 362 SAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12533     1 STIYVQGLNENVTLEELADFFKHCGVVKINKRTGQPMINIYTDKETGKPKGDATVSYEDPPAAKAAVEWF 70
RRM smart00360
RNA recognition motif;
364-429 1.26e-11

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 60.30  E-value: 1.26e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471  364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:smart00360   2 LFVGNLPPDTTEEELRELFSKFGKVES--------VRLVRDKETGKSKGFAFVEFESEEDAEKALE 59
zf-RanBP pfam00641
Zn-finger in Ran binding protein and others;
478-520 3.49e-08

Zn-finger in Ran binding protein and others;


Pssm-ID: 395516 [Multi-domain]  Cd Length: 30  Bit Score: 49.27  E-value: 3.49e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1034628471 478 RAGDWQCPnpsigdfccdvivcrGCGNQNFAWRTECNQCKAPK 520
Cdd:pfam00641   1 REGDWDCS---------------KCLVQNFATSTKCVACQAPK 28
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
364-429 5.50e-08

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 49.92  E-value: 5.50e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgqpmiHIYL-DKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:pfam00076   1 LFVGNLPPDTTEEDLKDLFSKFGPIK----------SIRLvRDETGRSKGFAFVEFEDEEDAEKAIE 57
RRM COG0724
RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];
364-429 1.05e-07

RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 440488 [Multi-domain]  Cd Length: 85  Bit Score: 49.71  E-value: 1.05e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:COG0724     4 IYVGNLPYSVTEEDLRELFSEYGEVTS--------VKLITDRETGRSRGFGFVEMPDDEEAQAAIE 61
ZnF_RBZ smart00547
Zinc finger domain; Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. ...
480-519 2.08e-05

Zinc finger domain; Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.


Pssm-ID: 197784 [Multi-domain]  Cd Length: 25  Bit Score: 41.53  E-value: 2.08e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1034628471  480 GDWQCPnpsigdfccdvivcrGCGNQNFAWRTECNQCKAP 519
Cdd:smart00547   1 GDWECP---------------ACTFLNFASRSKCFACGAP 25
PHA03247 PHA03247
large tegument protein UL36; Provisional
72-207 8.72e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 8.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471   72 GQPPTGYTTPTAPQA--YSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTET 149
Cdd:PHA03247  2869 RSPAAKPAAPARPPVrrLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034628471  150 SQPQSS-TGGYNQPSLGY---GQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQS 207
Cdd:PHA03247  2949 PAGAGEpSGAVPQPWLGAlvpGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS 3010
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
120-213 2.52e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.95  E-value: 2.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471 120 GTQPAYPAYGQQPAATAPTRPQ--DGNKPTETSQ-PQSSTG--GYNQPSLGYGQSNySYPQVPGSYPMQPVTAPPSYppT 194
Cdd:TIGR01628 393 GAMGQPPYYGQGPQQQFNGQPLgwPRMSMMPTPMgPGGPLRpnGLAPMNAVRAPSR-NAQNAAQKPPMQPVMYPPNY--Q 469
                          90
                  ....*....|....*....
gi 1034628471 195 SYSSTQPTSYDQSSYSQQN 213
Cdd:TIGR01628 470 SLPLSQDLPQPQSTASQGG 488
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
364-428 3.41e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.56  E-value: 3.41e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNKrtgqpmihIYLDkETGKPKGDATVSYEDPPTAKAAV 428
Cdd:TIGR01628 288 LYVKNLDDTVTDEKLRELFSECGEITSAK--------VMLD-EKGVSRGFGFVCFSNPEEANRAV 343
 
Name Accession Description Interval E-value
RRM_EWS cd12533
RNA recognition motif (RRM) found in vertebrate Ewing Sarcoma Protein (EWS); This subgroup ...
362-431 3.57e-42

RNA recognition motif (RRM) found in vertebrate Ewing Sarcoma Protein (EWS); This subgroup corresponds to the RRM of EWS, also termed Ewing sarcoma breakpoint region 1 protein, a member of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a multifunctional protein and may play roles in transcription and RNA processing. EWS is involved in transcriptional regulation by interacting with the preinitiation complex TFIID and the RNA polymerase II (RNAPII) complexes. It is also associated with splicing factors, such as the U1 snRNP protein U1C, suggesting its implication in pre-mRNA splicing. Additionally, EWS has been shown to regulate DNA damage-induced alternative splicing (AS). Like other members in the FET family, EWS contains an N-terminal Ser, Gly, Gln and Tyr-rich region composed of multiple copies of a degenerate hexapeptide repeat motif. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a C2/C2 zinc-finger motif, a conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and at least 1 arginine-glycine-glycine (RGG)-repeat region. EWS specifically binds to poly G and poly U RNA. It also binds to the proximal-element DNA of the macrophage-specific promoter of the CSF-1 receptor gene.


Pssm-ID: 409950 [Multi-domain]  Cd Length: 84  Bit Score: 146.91  E-value: 3.57e-42
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471 362 SAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12533     1 STIYVQGLNENVTLEELADFFKHCGVVKINKRTGQPMINIYTDKETGKPKGDATVSYEDPPAAKAAVEWF 70
RRM_FET cd12280
RNA recognition motif (RRM) found in the FET family of RNA-binding proteins; This subfamily ...
364-431 2.05e-35

RNA recognition motif (RRM) found in the FET family of RNA-binding proteins; This subfamily corresponds to the RRM of FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA-binding proteins. This ubiquitously expressed family of similarly structured proteins predominantly localizing to the nuclear, includes FUS (also known as TLS or Pigpen or hnRNP P2), EWS (also known as EWSR1), TAF15 (also known as hTAFII68 or TAF2N or RPB56), and Drosophila Cabeza (also known as SARFH). The corresponding coding genes of these proteins are involved in deleterious genomic rearrangements with transcription factor genes in a variety of human sarcomas and acute leukemias. All FET proteins interact with each other and are therefore likely to be part of the very same protein complexes, which suggests a general bridging role for FET proteins coupling RNA transcription, processing, transport, and DNA repair. The FET proteins contain multiple copies of a degenerate hexapeptide repeat motif at the N-terminus. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a putative zinc-finger domain, and a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), which is flanked by 3 arginine-glycine-glycine (RGG) boxes. FUS and EWS might have similar sequence specificity; both bind preferentially to GGUG-containing RNAs. FUS has also been shown to bind strongly to human telomeric RNA and to small low-copy-number RNAs tethered to the promoter of cyclin D1. To date, nothing is known about the RNA binding specificity of TAF15.


Pssm-ID: 409722 [Multi-domain]  Cd Length: 82  Bit Score: 128.30  E-value: 2.05e-35
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12280     1 IFVSGLPPDVTIDELADLFGQIGIIKRYKDTWPPKIKIYTDKETGKPKGEATLTYEDPSAAKAAIEWF 68
RRM_FUS_TAF15 cd12535
RNA recognition motif (RRM) found in vertebrate fused in Ewing's sarcoma protein (FUS), ...
360-431 2.59e-32

RNA recognition motif (RRM) found in vertebrate fused in Ewing's sarcoma protein (FUS), TATA-binding protein-associated factor 15 (TAF15) and similar proteins; This subgroup corresponds to the RRM of FUS and TAF15. FUS (TLS or Pigpen or hnRNP P2), also termed 75 kDa DNA-pairing protein (POMp75), or oncoprotein TLS (Translocated in liposarcoma), is a member of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a multi-functional protein and has been implicated in pre-mRNA splicing, chromosome stability, cell spreading, and transcription. FUS was originally identified in human myxoid and round cell liposarcomas as an oncogenic fusion with the stress-induced DNA-binding transcription factor CHOP (CCAAT enhancer-binding homologous protein) and later as hnRNP P2, a component of hnRNP H complex assembled on pre-mRNA. It can form ternary complexes with hnRNP A1 and hnRNP C1/C2. Additional research indicates that FUS binds preferentially to GGUG-containing RNAs. In the presence of Mg2+, it can bind both single- and double-stranded DNA (ssDNA/dsDNA) and promote ATP-independent annealing of complementary ssDNA and D-loop formation in superhelical dsDNA. FUS has been shown to be recruited by single stranded noncoding RNAs to the regulatory regions of target genes such as cyclin D1, where it represses transcription by disrupting complex formation. TAF15 (TAFII68), also termed TATA-binding protein-associated factor 2N (TAF2N), or RNA-binding protein 56 (RBP56), originally identified as a TAF in the general transcription initiation TFIID complex, is a novel RNA/ssDNA-binding protein with homology to the proto-oncoproteins FUS and EWS (also termed EWSR1), belonging to the FET family as well. TAF15 likely functions in RNA polymerase II (RNAP II) transcription by interacting with TFIID and subunits of RNAP II itself. TAF15 is also associated with U1 snRNA, chromatin and RNA, in a complex distinct from the Sm-containing U1 snRNP that functions in splicing. Like other members in the FET family, both FUS and TAF15 contain an N-terminal Ser, Gly, Gln and Tyr-rich region composed of multiple copies of a degenerate hexapeptide repeat motif. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a C2/C2 zinc-finger motif, a conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and at least 1 arginine-glycine-glycine (RGG)-repeat region.


Pssm-ID: 409951 [Multi-domain]  Cd Length: 86  Bit Score: 119.63  E-value: 2.59e-32
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034628471 360 DNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12535     1 DNNTIFVQGLGEDVTIDSVADYFKQIGIIKTNKKTGKPMINLYTDKETGKLKGEATVSFDDPPSAKAAIDWF 72
RRM_SARFH cd12534
RNA recognition motif (RRM) found in Drosophila melanogaster RNA-binding protein cabeza and ...
364-431 5.09e-29

RNA recognition motif (RRM) found in Drosophila melanogaster RNA-binding protein cabeza and similar proteins; This subgroup corresponds to the RRM in cabeza, also termed P19, or sarcoma-associated RNA-binding fly homolog (SARFH). It is a putative homolog of human RNA-binding proteins FUS (also termed TLS or Pigpen or hnRNP P2), EWS (also termed EWSR1), TAF15 (also termed hTAFII68 or TAF2N or RPB56), and belongs to the of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a nuclear RNA binding protein that may play an important role in the regulation of RNA metabolism during fly development. Cabeza contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 240978 [Multi-domain]  Cd Length: 83  Bit Score: 110.20  E-value: 5.09e-29
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12534     1 VFVSNLPPNTTEQDLAEHFGSIGIIKIDKKTGKPKIWLYKDKDTGEPKGEATVTYDDPHAASAAIEWF 68
RRM1_TatSF1_like cd12281
RNA recognition motif 1 (RRM1) found in HIV Tat-specific factor 1 (Tat-SF1) and similar ...
361-418 4.07e-16

RNA recognition motif 1 (RRM1) found in HIV Tat-specific factor 1 (Tat-SF1) and similar proteins; This subfamily corresponds to the RRM1 of Tat-SF1 and CUS2. Tat-SF1 is the cofactor for stimulation of transcriptional elongation by human immunodeficiency virus-type 1 (HIV-1) Tat. It is a substrate of an associated cellular kinase. Tat-SF1 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a highly acidic carboxyl-terminal half. The family also includes CUS2, a yeast homolog of human Tat-SF1. CUS2 interacts with U2 RNA in splicing extracts and functions as a splicing factor that aids assembly of the splicing-competent U2 snRNP in vivo. CUS2 also associates with PRP11 that is a subunit of the conserved splicing factor SF3a. Like Tat-SF1, CUS2 contains two RRMs as well.


Pssm-ID: 409723 [Multi-domain]  Cd Length: 92  Bit Score: 73.74  E-value: 4.07e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 361 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKEtGKPKGDATVSY 418
Cdd:cd12281     1 NTNVYVSGLPLDITVEEFVELFSKCGIIMEDPETGEPKIKLYRDEN-GNLKGDALCCY 57
RRM smart00360
RNA recognition motif;
364-429 1.26e-11

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 60.30  E-value: 1.26e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471  364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:smart00360   2 LFVGNLPPDTTEEELRELFSKFGKVES--------VRLVRDKETGKSKGFAFVEFESEEDAEKALE 59
zf-RanBP pfam00641
Zn-finger in Ran binding protein and others;
478-520 3.49e-08

Zn-finger in Ran binding protein and others;


Pssm-ID: 395516 [Multi-domain]  Cd Length: 30  Bit Score: 49.27  E-value: 3.49e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1034628471 478 RAGDWQCPnpsigdfccdvivcrGCGNQNFAWRTECNQCKAPK 520
Cdd:pfam00641   1 REGDWDCS---------------KCLVQNFATSTKCVACQAPK 28
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
364-429 5.02e-08

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 50.36  E-value: 5.02e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgqpMIHIYLDKEtGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd00590     1 LFVGNLPPDTTEEDLRELFSKFGEVV--------SVRIVRDRD-GKSKGFAFVEFESPEDAEKALE 57
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
364-429 5.50e-08

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 49.92  E-value: 5.50e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgqpmiHIYL-DKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:pfam00076   1 LFVGNLPPDTTEEDLKDLFSKFGPIK----------SIRLvRDETGRSKGFAFVEFEDEEDAEKAIE 57
RRM COG0724
RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];
364-429 1.05e-07

RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 440488 [Multi-domain]  Cd Length: 85  Bit Score: 49.71  E-value: 1.05e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:COG0724     4 IYVGNLPYSVTEEDLRELFSEYGEVTS--------VKLITDRETGRSRGFGFVEMPDDEEAQAAIE 61
RRM_II_PABPs cd12306
RNA recognition motif in type II polyadenylate-binding proteins; This subfamily corresponds to ...
364-429 2.96e-07

RNA recognition motif in type II polyadenylate-binding proteins; This subfamily corresponds to the RRM of type II polyadenylate-binding proteins (PABPs), including polyadenylate-binding protein 2 (PABP-2 or PABPN1), embryonic polyadenylate-binding protein 2 (ePABP-2 or PABPN1L) and similar proteins. PABPs are highly conserved proteins that bind to the poly(A) tail present at the 3' ends of most eukaryotic mRNAs. They have been implicated in the regulation of poly(A) tail length during the polyadenylation reaction, translation initiation, mRNA stabilization by influencing the rate of deadenylation and inhibition of mRNA decapping. ePABP-2 is predominantly located in the cytoplasm and PABP-2 is located in the nucleus. In contrast to the type I PABPs containing four copies of RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), the type II PABPs contains a single highly-conserved RRM. This subfamily also includes Saccharomyces cerevisiae RBP29 (SGN1, YIR001C) gene encoding cytoplasmic mRNA-binding protein Rbp29 that binds preferentially to poly(A). Although not essential for cell viability, Rbp29 plays a role in modulating the expression of cytoplasmic mRNA. Like other type II PABPs, Rbp29 contains one RRM only.


Pssm-ID: 409747 [Multi-domain]  Cd Length: 73  Bit Score: 48.07  E-value: 2.96e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVkmNKRTgqpmihIYLDKETGKPKGDATVSYEDPptakAAVE 429
Cdd:cd12306     2 IYVGNVDYGTTPEELQAHFKSCGTI--NRVT------ILCDKFTGQPKGFAYIEFVDK----SSVE 55
RRM_CSTF2_RNA15_like cd12398
RNA recognition motif (RRM) found in cleavage stimulation factor subunit 2 (CSTF2), yeast ...
372-428 4.28e-07

RNA recognition motif (RRM) found in cleavage stimulation factor subunit 2 (CSTF2), yeast ortholog mRNA 3'-end-processing protein RNA15 and similar proteins; This subfamily corresponds to the RRM domain of CSTF2, its tau variant and eukaryotic homologs. CSTF2, also termed cleavage stimulation factor 64 kDa subunit (CstF64), is the vertebrate conterpart of yeast mRNA 3'-end-processing protein RNA15. It is expressed in all somatic tissues and is one of three cleavage stimulatory factor (CstF) subunits required for polyadenylation. CstF64 contains an N-terminal RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a CstF77-binding domain, a repeated MEARA helical region and a conserved C-terminal domain reported to bind the transcription factor PC-4. During polyadenylation, CstF interacts with the pre-mRNA through the RRM of CstF64 at U- or GU-rich sequences within 10 to 30 nucleotides downstream of the cleavage site. CSTF2T, also termed tauCstF64, is a paralog of the X-linked cleavage stimulation factor CstF64 protein that supports polyadenylation in most somatic cells. It is expressed during meiosis and subsequent haploid differentiation in a more limited set of tissues and cell types, largely in meiotic and postmeiotic male germ cells, and to a lesser extent in brain. The loss of CSTF2T will cause male infertility, as it is necessary for spermatogenesis and fertilization. Moreover, CSTF2T is required for expression of genes involved in morphological differentiation of spermatids, as well as for genes having products that function during interaction of motile spermatozoa with eggs. It promotes germ cell-specific patterns of polyadenylation by using its RRM to bind to different sequence elements downstream of polyadenylation sites than does CstF64. The family also includes yeast ortholog mRNA 3'-end-processing protein RNA15 and similar proteins. RNA15 is a core subunit of cleavage factor IA (CFIA), an essential transcriptional 3'-end processing factor from Saccharomyces cerevisiae. RNA recognition by CFIA is mediated by an N-terminal RRM, which is contained in the RNA15 subunit of the complex. The RRM of RNA15 has a strong preference for GU-rich RNAs, mediated by a binding pocket that is entirely conserved in both yeast and vertebrate RNA15 orthologs.


Pssm-ID: 409832 [Multi-domain]  Cd Length: 77  Bit Score: 47.51  E-value: 4.28e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034628471 372 SVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAV 428
Cdd:cd12398    11 DATEEQLKEIFSEVGPVVS--------FRLVTDRETGKPKGYGFCEFRDAETALSAV 59
RRM_ist3_like cd12411
RNA recognition motif (RRM) found in ist3 family; This subfamily corresponds to the RRM of the ...
356-421 7.91e-07

RNA recognition motif (RRM) found in ist3 family; This subfamily corresponds to the RRM of the ist3 family that includes fungal U2 small nuclear ribonucleoprotein (snRNP) component increased sodium tolerance protein 3 (ist3), X-linked 2 RNA-binding motif proteins (RBMX2) found in Metazoa and plants, and similar proteins. Gene IST3 encoding ist3, also termed U2 snRNP protein SNU17 (Snu17p), is a novel yeast Saccharomyces cerevisiae protein required for the first catalytic step of splicing and for progression of spliceosome assembly. It binds specifically to the U2 snRNP and is an intrinsic component of prespliceosomes and spliceosomes. Yeast ist3 contains an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). In the yeast pre-mRNA retention and splicing complex, the atypical RRM of ist3 functions as a scaffold that organizes the other two constituents, Bud13p (bud site selection 13) and Pml1p (pre-mRNA leakage 1). Fission yeast Schizosaccharomyces pombe gene cwf29 encoding ist3, also termed cell cycle control protein cwf29, is an RNA-binding protein complexed with cdc5 protein 29. It also contains one RRM. The biological function of RBMX2 remains unclear. It shows high sequence similarity to yeast ist3 protein and harbors one RRM as well.


Pssm-ID: 409845 [Multi-domain]  Cd Length: 89  Bit Score: 47.20  E-value: 7.91e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 356 DEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVkmnkrtgqpmIHIYL--DKETGKPKGDATVSYEDP 421
Cdd:cd12411     4 DEYKDSAYIYIGGLPYELTEGDILCVFSQYGEI----------VDINLvrDKKTGKSKGFAFLAYEDQ 61
RRM3_RBM19_RRM2_MRD1 cd12316
RNA recognition motif 3 (RRM3) found in RNA-binding protein 19 (RBM19) and RNA recognition ...
364-429 1.62e-06

RNA recognition motif 3 (RRM3) found in RNA-binding protein 19 (RBM19) and RNA recognition motif 2 found in multiple RNA-binding domain-containing protein 1 (MRD1); This subfamily corresponds to the RRM3 of RBM19 and RRM2 of MRD1. RBM19, also termed RNA-binding domain-1 (RBD-1), is a nucleolar protein conserved in eukaryotes involved in ribosome biogenesis by processing rRNA and is essential for preimplantation development. It has a unique domain organization containing 6 conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). MRD1 is encoded by a novel yeast gene MRD1 (multiple RNA-binding domain). It is well conserved in yeast and its homologs exist in all eukaryotes. MRD1 is present in the nucleolus and the nucleoplasm. It interacts with the 35 S precursor rRNA (pre-rRNA) and U3 small nucleolar RNAs (snoRNAs). It is essential for the initial processing at the A0-A2 cleavage sites in the 35 S pre-rRNA. MRD1 contains 5 conserved RRMs, which may play an important structural role in organizing specific rRNA processing events.


Pssm-ID: 409755 [Multi-domain]  Cd Length: 74  Bit Score: 46.18  E-value: 1.62e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12316     2 LFVRNLPFTATEDELRELFEAFGKISE--------VHIPLDKQTKRSKGFAFVLFVIPEDAVKAYQ 59
RRM2_hnRNPA_like cd12328
RNA recognition motif 2 (RRM2) found in heterogeneous nuclear ribonucleoprotein A subfamily; ...
364-428 6.14e-06

RNA recognition motif 2 (RRM2) found in heterogeneous nuclear ribonucleoprotein A subfamily; This subfamily corresponds to the RRM2 of hnRNP A0, hnRNP A1, hnRNP A2/B1, hnRNP A3 and similar proteins. hnRNP A0 is a low abundance hnRNP protein that has been implicated in mRNA stability in mammalian cells. It has been identified as the substrate for MAPKAP-K2 and may be involved in the lipopolysaccharide (LPS)-induced post-transcriptional regulation of tumor necrosis factor-alpha (TNF-alpha), cyclooxygenase 2 (COX-2) and macrophage inflammatory protein 2 (MIP-2). hnRNP A1 is an abundant eukaryotic nuclear RNA-binding protein that may modulate splice site selection in pre-mRNA splicing. hnRNP A2/B1 is an RNA trafficking response element-binding protein that interacts with the hnRNP A2 response element (A2RE). Many mRNAs, such as myelin basic protein (MBP), myelin-associated oligodendrocytic basic protein (MOBP), carboxyanhydrase II (CAII), microtubule-associated protein tau, and amyloid precursor protein (APP) are trafficked by hnRNP A2/B1. hnRNP A3 is also a RNA trafficking response element-binding protein that participates in the trafficking of A2RE-containing RNA. The hnRNP A subfamily is characterized by two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), followed by a long glycine-rich region at the C-terminus.


Pssm-ID: 409766 [Multi-domain]  Cd Length: 73  Bit Score: 44.18  E-value: 6.14e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgqpMIHIYLDKETGKPKGDATVSYEDPPTAKAAV 428
Cdd:cd12328     2 LFVGGLKEDVEEEDLREYFSQFGKVE--------SVEIVTDKETGKKRGFAFVTFDDHDSVDKIV 58
RRM2_SART3 cd12392
RNA recognition motif 2 (RRM2) found in squamous cell carcinoma antigen recognized by T-cells ...
364-428 9.28e-06

RNA recognition motif 2 (RRM2) found in squamous cell carcinoma antigen recognized by T-cells 3 (SART3) and similar proteins; This subfamily corresponds to the RRM2 of SART3, also termed Tat-interacting protein of 110 kDa (Tip110), is an RNA-binding protein expressed in the nucleus of the majority of proliferating cells, including normal cells and malignant cells, but not in normal tissues except for the testes and fetal liver. It is involved in the regulation of mRNA splicing probably via its complex formation with RNA-binding protein with a serine-rich domain (RNPS1), a pre-mRNA-splicing factor. SART3 has also been identified as a nuclear Tat-interacting protein that regulates Tat transactivation activity through direct interaction and functions as an important cellular factor for HIV-1 gene expression and viral replication. In addition, SART3 is required for U6 snRNP targeting to Cajal bodies. It binds specifically and directly to the U6 snRNA, interacts transiently with the U6 and U4/U6 snRNPs, and promotes the reassembly of U4/U6 snRNPs after splicing in vitro. SART3 contains an N-terminal half-a-tetratricopeptide repeat (HAT)-rich domain, a nuclearlocalization signal (NLS) domain, and two C-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409826 [Multi-domain]  Cd Length: 81  Bit Score: 44.24  E-value: 9.28e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgQPMIHIYLdkeTGKPKGDATVSYEDPPTAKAAV 428
Cdd:cd12392     5 LFVKGLPFSCTKEELEELFKQHGTVK------DVRLVTYR---NGKPKGLAYVEYENEADASQAV 60
RRM1_NUCLs cd12450
RNA recognition motif 1 (RRM1) found in nucleolin-like proteins mainly from plants; This ...
364-429 1.06e-05

RNA recognition motif 1 (RRM1) found in nucleolin-like proteins mainly from plants; This subfamily corresponds to the RRM1 of a group of plant nucleolin-like proteins, including nucleolin 1 (also termed protein nucleolin like 1) and nucleolin 2 (also termed protein nucleolin like 2, or protein parallel like 1). They play roles in the regulation of ribosome synthesis and in the growth and development of plants. Like yeast nucleolin, nucleolin-like proteins possess two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409884 [Multi-domain]  Cd Length: 78  Bit Score: 43.93  E-value: 1.06e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgqpMIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12450     2 LFVGNLSWSATQDDLENFFSDCGEVV--------DVRIAMDRDDGRSKGFGHVEFASAESAQKALE 59
ZnF_RBZ smart00547
Zinc finger domain; Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. ...
480-519 2.08e-05

Zinc finger domain; Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.


Pssm-ID: 197784 [Multi-domain]  Cd Length: 25  Bit Score: 41.53  E-value: 2.08e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1034628471  480 GDWQCPnpsigdfccdvivcrGCGNQNFAWRTECNQCKAP 519
Cdd:smart00547   1 GDWECP---------------ACTFLNFASRSKCFACGAP 25
RRM3_I_PABPs cd12380
RNA recognition motif 3 (RRM3) found found in type I polyadenylate-binding proteins; This ...
364-429 3.14e-05

RNA recognition motif 3 (RRM3) found found in type I polyadenylate-binding proteins; This subfamily corresponds to the RRM3 of type I poly(A)-binding proteins (PABPs), highly conserved proteins that bind to the poly(A) tail present at the 3' ends of most eukaryotic mRNAs. They have been implicated in the regulation of poly(A) tail length during the polyadenylation reaction, translation initiation, mRNA stabilization by influencing the rate of deadenylation and inhibition of mRNA decapping. The family represents type I polyadenylate-binding proteins (PABPs), including polyadenylate-binding protein 1 (PABP-1 or PABPC1), polyadenylate-binding protein 3 (PABP-3 or PABPC3), polyadenylate-binding protein 4 (PABP-4 or APP-1 or iPABP), polyadenylate-binding protein 5 (PABP-5 or PABPC5), polyadenylate-binding protein 1-like (PABP-1-like or PABPC1L), polyadenylate-binding protein 1-like 2 (PABPC1L2 or RBM32), polyadenylate-binding protein 4-like (PABP-4-like or PABPC4L), yeast polyadenylate-binding protein, cytoplasmic and nuclear (PABP or ACBP-67), and similar proteins. PABP-1 is an ubiquitously expressed multifunctional protein that may play a role in 3' end formation of mRNA, translation initiation, mRNA stabilization, protection of poly(A) from nuclease activity, mRNA deadenylation, inhibition of mRNA decapping, and mRNP maturation. Although PABP-1 is thought to be a cytoplasmic protein, it is also found in the nucleus. PABP-1 may be involved in nucleocytoplasmic trafficking and utilization of mRNP particles. PABP-1 contains four copies of RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), a less well conserved linker region, and a proline-rich C-terminal conserved domain (CTD). PABP-3 is a testis-specific poly(A)-binding protein specifically expressed in round spermatids. It is mainly found in mammalian and may play an important role in the testis-specific regulation of mRNA homeostasis. PABP-3 shows significant sequence similarity to PABP-1. However, it binds to poly(A) with a lower affinity than PABP-1. PABP-1 possesses an A-rich sequence in its 5'-UTR and allows binding of PABP and blockage of translation of its own mRNA. In contrast, PABP-3 lacks the A-rich sequence in its 5'-UTR. PABP-4 is an inducible poly(A)-binding protein (iPABP) that is primarily localized to the cytoplasm. It shows significant sequence similarity to PABP-1 as well. The RNA binding properties of PABP-1 and PABP-4 appear to be identical. PABP-5 is encoded by PABPC5 gene within the X-specific subinterval, and expressed in fetal brain and in a range of adult tissues in mammalian, such as ovary and testis. It may play an important role in germ cell development. Moreover, unlike other PABPs, PABP-5 contains only four RRMs, but lacks both the linker region and the CTD. PABP-1-like and PABP-1-like 2 are the orthologs of PABP-1. PABP-4-like is the ortholog of PABP-5. Their cellular functions remain unclear. The family also includes the yeast PABP, a conserved poly(A) binding protein containing poly(A) tails that can be attached to the 3'-ends of mRNAs. The yeast PABP and its homologs may play important roles in the initiation of translation and in mRNA decay. Like vertebrate PABP-1, the yeast PABP contains four RRMs, a linker region, and a proline-rich CTD as well. The first two RRMs are mainly responsible for specific binding to poly(A). The proline-rich region may be involved in protein-protein interactions.


Pssm-ID: 409814 [Multi-domain]  Cd Length: 80  Bit Score: 42.55  E-value: 3.14e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGvvkmnkrtgqPMIHIYLDK-ETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12380     4 VYVKNFGEDVDDDELKELFEKYG----------KITSAKVMKdDSGKSKGFGFVNFENHEAAQKAVE 60
RRM1_GRSF1 cd12730
RNA recognition motif 1 (RRM1) found in G-rich sequence factor 1 (GRSF-1) and similar proteins; ...
364-429 5.39e-05

RNA recognition motif 1 (RRM1) found in G-rich sequence factor 1 (GRSF-1) and similar proteins; This subgroup corresponds to the RRM1 of GRSF-1, a cytoplasmic poly(A)+ mRNA binding protein which interacts with RNA in a G-rich element-dependent manner. It may function in RNA packaging, stabilization of RNA secondary structure, or other macromolecular interactions. GRSF-1 contains three potential RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), which are responsible for the RNA binding. In addition, GRSF-1 has two auxiliary domains, an acidic alpha-helical domain and an N-terminal alanine-rich region, that may play a role in protein-protein interactions and provide binding specificity.


Pssm-ID: 410129 [Multi-domain]  Cd Length: 79  Bit Score: 41.71  E-value: 5.39e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVvkmnkRTGQPMIHIYLDKEtGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12730     4 VRARGLPWSCTAEDVLSFFSDCRI-----RNGEDGIHFLLNRD-GKRRGDALIELESEEDVQKALE 63
RRM_eIF3G_like cd12408
RNA recognition motif (RRM) found in eukaryotic translation initiation factor 3 subunit G ...
364-429 5.90e-05

RNA recognition motif (RRM) found in eukaryotic translation initiation factor 3 subunit G (eIF-3G) and similar proteins; This subfamily corresponds to the RRM of eIF-3G and similar proteins. eIF-3G, also termed eIF-3 subunit 4, or eIF-3-delta, or eIF3-p42, or eIF3-p44, is the RNA-binding subunit of eIF3, a large multisubunit complex that plays a central role in the initiation of translation by binding to the 40 S ribosomal subunit and promoting the binding of methionyl-tRNAi and mRNA. eIF-3G binds 18 S rRNA and beta-globin mRNA, and therefore appears to be a nonspecific RNA-binding protein. eIF-3G is one of the cytosolic targets and interacts with mature apoptosis-inducing factor (AIF). eIF-3G contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). This family also includes yeast eIF3-p33, a homolog of vertebrate eIF-3G, plays an important role in the initiation phase of protein synthesis in yeast. It binds both, mRNA and rRNA, fragments due to an RRM near its C-terminus.


Pssm-ID: 409842 [Multi-domain]  Cd Length: 76  Bit Score: 41.73  E-value: 5.90e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGvvkmnkrtgqPMIHIYL--DKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12408     2 IRVTNLSEDATEEDLRELFRPFG----------PISRVYLakDKETGQSKGFAFVTFETREDAERAIE 59
RRM_HP0827_like cd12399
RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; ...
364-428 6.81e-05

RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; This subfamily corresponds to the RRM of H. pylori HP0827, a putative ssDNA-binding protein 12rnp2 precursor, containing one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The ssDNA binding may be important in activation of HP0827.


Pssm-ID: 409833 [Multi-domain]  Cd Length: 75  Bit Score: 41.35  E-value: 6.81e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAV 428
Cdd:cd12399     1 LYVGNLPYSASEEQLKSLFGQFGAVFD--------VKLPMDRETKRPRGFGFVELQEEESAEKAI 57
RRM1_MSSP cd12243
RNA recognition motif 1 (RRM1) found in the c-myc gene single-strand binding proteins (MSSP) ...
364-431 6.99e-05

RNA recognition motif 1 (RRM1) found in the c-myc gene single-strand binding proteins (MSSP) family; This subfamily corresponds to the RRM1 of c-myc gene single-strand binding proteins (MSSP) family, including single-stranded DNA-binding protein MSSP-1 (also termed RBMS1 or SCR2) and MSSP-2 (also termed RBMS2 or SCR3). All MSSP family members contain two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), both of which are responsible for the specific DNA binding activity. Both, MSSP-1 and -2, have been identified as protein factors binding to a putative DNA replication origin/transcriptional enhancer sequence present upstream from the human c-myc gene in both single- and double-stranded forms. Thus, they have been implied in regulating DNA replication, transcription, apoptosis induction, and cell-cycle movement, via the interaction with c-MYC, the product of protooncogene c-myc. Moreover, the family includes a new member termed RNA-binding motif, single-stranded-interacting protein 3 (RBMS3), which is not a transcriptional regulator. RBMS3 binds with high affinity to A/U-rich stretches of RNA, and to A/T-rich DNA sequences, and functions as a regulator of cytoplasmic activity. In addition, a putative meiosis-specific RNA-binding protein termed sporulation-specific protein 5 (SPO5, or meiotic RNA-binding protein 1, or meiotically up-regulated gene 12 protein), encoded by Schizosaccharomyces pombe Spo5/Mug12 gene, is also included in this family. SPO5 is a novel meiosis I regulator that may function in the vicinity of the Mei2 dot.


Pssm-ID: 409689 [Multi-domain]  Cd Length: 71  Bit Score: 41.14  E-value: 6.99e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNKrtgqpmihIYLDKETGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12243     3 VYIRGLPPNTTDEDLLLLCQSFGKIISTK--------AIIDKQTNKCKGYGFVDFDSPEAALKAIEGL 62
PHA03247 PHA03247
large tegument protein UL36; Provisional
72-207 8.72e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 8.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471   72 GQPPTGYTTPTAPQA--YSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTET 149
Cdd:PHA03247  2869 RSPAAKPAAPARPPVrrLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034628471  150 SQPQSS-TGGYNQPSLGY---GQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQS 207
Cdd:PHA03247  2949 PAGAGEpSGAVPQPWLGAlvpGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS 3010
RRM_RBM8 cd12324
RNA recognition motif (RRM) found in RNA-binding protein RBM8A, RBM8B nd similar proteins; ...
364-429 1.20e-04

RNA recognition motif (RRM) found in RNA-binding protein RBM8A, RBM8B nd similar proteins; This subfamily corresponds to the RRM of RBM8, also termed binder of OVCA1-1 (BOV-1), or RNA-binding protein Y14, which is one of the components of the exon-exon junction complex (EJC). It has two isoforms, RBM8A and RBM8B, both of which are identical except that RBM8B is 16 amino acids shorter at its N-terminus. RBM8, together with other EJC components (such as Magoh, Aly/REF, RNPS1, Srm160, and Upf3), plays critical roles in postsplicing processing, including nuclear export and cytoplasmic localization of the mRNA, and the nonsense-mediated mRNA decay (NMD) surveillance process. RBM8 binds to mRNA 20-24 nucleotides upstream of a spliced exon-exon junction. It is also involved in spliced mRNA nuclear export, and the process of nonsense-mediated decay of mRNAs with premature stop codons. RBM8 forms a specific heterodimer complex with the EJC protein Magoh which then associates with Aly/REF, RNPS1, DEK, and SRm160 on the spliced mRNA, and inhibits ATP turnover by eIF4AIII, thereby trapping the EJC core onto RNA. RBM8 contains an N-terminal putative bipartite nuclear localization signal, one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), in the central region, and a C-terminal serine-arginine rich region (SR domain) and glycine-arginine rich region (RG domain).


Pssm-ID: 409762 [Multi-domain]  Cd Length: 88  Bit Score: 41.06  E-value: 1.20e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12324     9 IFVTGVHEEAQEEDIHDKFAEFGEIKN--------LHLNLDRRTGFVKGYALVEYETKKEAQAAIE 66
RRM3_HRB1_GBP2 cd21607
RNA recognition motif 3 (RRM3) found in Saccharomyces cerevisiae protein HRB1, ...
361-429 1.24e-04

RNA recognition motif 3 (RRM3) found in Saccharomyces cerevisiae protein HRB1, G-strand-binding protein 2 (GBP2) and similar proteins; The family includes Saccharomyces cerevisiae protein HRB1 (also called protein TOM34) and GBP2, both of which are SR-like mRNA-binding proteins which shuttle from the nucleus to the cytoplasm when bound to the mature mRNA molecules. They act as quality control factors for spliced mRNAs. GBP2, also called RAP1 localization factor 6, is a single-strand telomeric DNA-binding protein that binds single-stranded telomeric sequences of the type (TG[1-3])n in vitro. It also binds to RNA. GBP2 influences the localization of RAP1 in the nuclei and plays a role in modulating telomere length. Members in this family contain three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The model corresponds to the third RRM motif.


Pssm-ID: 410186 [Multi-domain]  Cd Length: 79  Bit Score: 40.77  E-value: 1.24e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034628471 361 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKrtgqpmihIYLDkETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd21607     2 NNTIYCSNLPLSTAESDLYDLFETIGKVNNAE--------LKYD-ETGDPTGSAVVEYENLDDADVCIS 61
RRM2_hnRNPA0 cd12579
RNA recognition motif 2 (RRM2) found in heterogeneous nuclear ribonucleoprotein A0 (hnRNP A0) ...
364-431 1.75e-04

RNA recognition motif 2 (RRM2) found in heterogeneous nuclear ribonucleoprotein A0 (hnRNP A0) and similar proteins; This subgroup corresponds to the RRM2 of hnRNP A0, a low abundance hnRNP protein that has been implicated in mRNA stability in mammalian cells. It has been identified as the substrate for MAPKAP-K2 and may be involved in the lipopolysaccharide (LPS)-induced post-transcriptional regulation of tumor necrosis factor-alpha (TNF-alpha), cyclooxygenase 2 (COX-2) and macrophage inflammatory protein 2 (MIP-2). hnRNP A0 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), followed by a long glycine-rich region at the C-terminus.


Pssm-ID: 409993 [Multi-domain]  Cd Length: 80  Bit Score: 40.59  E-value: 1.75e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgqpMIHIYLDKETGKPKGDATVSYEDPPTA-KAAVEWF 431
Cdd:cd12579     2 LFVGGLKGDVGEGDLVEHFSQFGTVE--------KVEVIADKDTGKKRGFGFVYFEDHDSAdKAAVVKF 62
RRM1_Nop4p cd12674
RNA recognition motif 1 (RRM1) found in yeast nucleolar protein 4 (Nop4p) and similar proteins; ...
362-428 2.05e-04

RNA recognition motif 1 (RRM1) found in yeast nucleolar protein 4 (Nop4p) and similar proteins; This subgroup corresponds to the RRM1 of Nop4p (also known as Nop77p), encoded by YPL043W from Saccharomyces cerevisiae. It is an essential nucleolar protein involved in processing and maturation of 27S pre-rRNA and biogenesis of 60S ribosomal subunits. Nop4p has four RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 410075 [Multi-domain]  Cd Length: 80  Bit Score: 40.14  E-value: 2.05e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034628471 362 SAIYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgQPMihIYLDKETGKPKGDATVSYEDPPTAKAAV 428
Cdd:cd12674     1 TTLFVRNLPFDVTLESLTDFFSDIGPVK------HAV--VVTDPETKKSRGYGFVSFSTHDDAEEAL 59
RRM2_NsCP33_like cd21608
RNA recognition motif 2 (RRM2) found in Nicotiana sylvestris chloroplastic 33 kDa ...
364-429 2.33e-04

RNA recognition motif 2 (RRM2) found in Nicotiana sylvestris chloroplastic 33 kDa ribonucleoprotein (NsCP33) and similar proteins; The family includes NsCP33, Arabidopsis thaliana chloroplastic 31 kDa ribonucleoprotein (CP31A) and mitochondrial glycine-rich RNA-binding protein 2 (AtGR-RBP2). NsCP33 may be involved in splicing and/or processing of chloroplast RNA's. AtCP31A, also called RNA-binding protein 1/2/3 (AtRBP33), or RNA-binding protein CP31A, or RNA-binding protein RNP-T, or RNA-binding protein cp31, is required for specific RNA editing events in chloroplasts and stabilizes specific chloroplast mRNAs, as well as for normal chloroplast development under cold stress conditions by stabilizing transcripts of numerous mRNAs under these conditions. CP31A may modulate telomere replication through RNA binding domains. AtGR-RBP2, also called AtRBG2, or glycine-rich protein 2 (AtGRP2), or mitochondrial RNA-binding protein 1a (At-mRBP1a), plays a role in RNA transcription or processing during stress. It binds RNAs and DNAs sequence with a preference to single-stranded nucleic acids. AtGR-RBP2 displays strong affinity to poly(U) sequence. It exerts cold and freezing tolerance, probably by exhibiting an RNA chaperone activity during the cold and freezing adaptation process. Some members in this family contain two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The model corresponds to the second RRM motif.


Pssm-ID: 410187 [Multi-domain]  Cd Length: 76  Bit Score: 39.84  E-value: 2.33e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVkmnkrTGQPMIHiylDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd21608     2 LYVGNLSWDTTEDDLRDLFSEFGEV-----ESAKVIT---DRETGRSRGFGFVTFSTAEAAEAAID 59
RRM3_RBM28_like cd12415
RNA recognition motif 3 (RRM3) found in RNA-binding protein 28 (RBM28) and similar proteins; ...
364-429 3.19e-04

RNA recognition motif 3 (RRM3) found in RNA-binding protein 28 (RBM28) and similar proteins; This subfamily corresponds to the RRM3 of RBM28 and Nop4p. RBM28 is a specific nucleolar component of the spliceosomal small nuclear ribonucleoproteins (snRNPs), possibly coordinating their transition through the nucleolus. It specifically associates with U1, U2, U4, U5, and U6 small nuclear RNAs (snRNAs), and may play a role in the maturation of both small nuclear and ribosomal RNAs. RBM28 has four RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and an extremely acidic region between RRM2 and RRM3. The family also includes nucleolar protein 4 (Nop4p or Nop77p) encoded by YPL043W from Saccharomyces cerevisiae. It is an essential nucleolar protein involved in processing and maturation of 27S pre-rRNA and biogenesis of 60S ribosomal subunits. Nop4p also contains four RRMs.


Pssm-ID: 409849 [Multi-domain]  Cd Length: 83  Bit Score: 39.89  E-value: 3.19e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNKrtgqpmihIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12415     3 VFIRNLSFDTTEEDLKEFFSKFGEVKYAR--------IVLDKDTGHSKGTAFVQFKTKESADKCIE 60
RRM_DAZL_BOULE cd12412
RNA recognition motif (RRM) found in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and ...
364-430 4.20e-04

RNA recognition motif (RRM) found in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and BOULE; This subfamily corresponds to the RRM domain of two Deleted in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and BOULE. BOULE is the founder member of the family and DAZL arose from BOULE in an ancestor of vertebrates. The DAZ gene subsequently originated from a duplication transposition of the DAZL gene. Invertebrates contain a single DAZ homolog, BOULE, while vertebrates, other than catarrhine primates, possess both BOULE and DAZL genes. The catarrhine primates possess BOULE, DAZL, and DAZ genes. The family members encode closely related RNA-binding proteins that are required for fertility in numerous organisms. These proteins contain an RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a varying number of copies of a DAZ motif, believed to mediate protein-protein interactions. DAZL and BOULE contain a single copy of the DAZ motif, while DAZ proteins can contain 8-24 copies of this repeat. Although their specific biochemical functions remain to be investigated, DAZL proteins may interact with poly(A)-binding proteins (PABPs), and act as translational activators of specific mRNAs during gametogenesis.


Pssm-ID: 409846 [Multi-domain]  Cd Length: 81  Bit Score: 39.52  E-value: 4.20e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNKrtgqpmihIYLDKEtGKPKGDATVSYEDPPTAKAAVEW 430
Cdd:cd12412     5 IFVGGIDWDTTEEELREFFSKFGKVKDVK--------IIKDRA-GVSKGYGFVTFETQEDAEKIQKW 62
RRM2_SREK1 cd12260
RNA recognition motif 2 (RRM2) found in splicing regulatory glutamine/lysine-rich protein 1 ...
364-404 4.82e-04

RNA recognition motif 2 (RRM2) found in splicing regulatory glutamine/lysine-rich protein 1 (SREK1) and similar proteins; This subfamily corresponds to the RRM2 of SREK1, also termed serine/arginine-rich-splicing regulatory protein 86-kDa (SRrp86), or splicing factor arginine/serine-rich 12 (SFRS12), or splicing regulatory protein 508 amino acid (SRrp508). SREK1 belongs to a family of proteins containing regions rich in serine-arginine dipeptides (SR proteins family), which is involved in bridge-complex formation and splicing by mediating protein-protein interactions across either introns or exons. It is a unique SR family member and it may play a crucial role in determining tissue specific patterns of alternative splicing. SREK1 can alter splice site selection by both positively and negatively modulating the activity of other SR proteins. For instance, SREK1 can activate SRp20 and repress SC35 in a dose-dependent manner both in vitro and in vivo. In addition, SREK1 contains two (some contain only one) RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and two serine-arginine (SR)-rich domains (SR domains) separated by an unusual glutamic acid-lysine (EK) rich region. The RRM and SR domains are highly conserved among other members of the SR superfamily. However, the EK domain is unique to SREK1. It plays a modulatory role controlling SR domain function by involvement in the inhibition of both constitutive and alternative splicing and in the selection of splice-site.


Pssm-ID: 409705 [Multi-domain]  Cd Length: 85  Bit Score: 39.21  E-value: 4.82e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVK---MNKRTGQPMIHIYLD 404
Cdd:cd12260     7 VYVGNLDPSTTADQLLEFFSQAGEVKyvrMAGDETQPTRYAFVE 50
RRM2_MRD1 cd12566
RNA recognition motif 2 (RRM2) found in yeast multiple RNA-binding domain-containing protein 1 ...
360-427 4.95e-04

RNA recognition motif 2 (RRM2) found in yeast multiple RNA-binding domain-containing protein 1 (MRD1) and similar proteins; This subgroup corresponds to the RRM2 of MRD1 which is encoded by a novel yeast gene MRD1 (multiple RNA-binding domain). It is well-conserved in yeast and its homologs exist in all eukaryotes. MRD1 is present in the nucleolus and the nucleoplasm. It interacts with the 35 S precursor rRNA (pre-rRNA) and U3 small nucleolar RNAs (snoRNAs). It is essential for the initial processing at the A0-A2 cleavage sites in the 35 S pre-rRNA. MRD1 contains 5 conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), which may play an important structural role in organizing specific rRNA processing events.


Pssm-ID: 409982 [Multi-domain]  Cd Length: 79  Bit Score: 39.32  E-value: 4.95e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 360 DNSAIYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAA 427
Cdd:cd12566     1 ETGRLFLRNLPYSTKEDDLQKLFSKFGEVSE--------VHVPIDKKTKKSKGFAYVLFLDPEDAVQA 60
RRM_PPIE cd12347
RNA recognition motif (RRM) found in cyclophilin-33 (Cyp33) and similar proteins; This ...
364-429 4.98e-04

RNA recognition motif (RRM) found in cyclophilin-33 (Cyp33) and similar proteins; This subfamily corresponds to the RRM of Cyp33, also termed peptidyl-prolyl cis-trans isomerase E (PPIase E), or cyclophilin E, or rotamase E. Cyp33 is a nuclear RNA-binding cyclophilin with an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal PPIase domain. Cyp33 possesses RNA-binding activity and preferentially binds to polyribonucleotide polyA and polyU, but hardly to polyG and polyC. It binds specifically to mRNA, which can stimulate its PPIase activity. Moreover, Cyp33 interacts with the third plant homeodomain (PHD3) zinc finger cassette of the mixed lineage leukemia (MLL) proto-oncoprotein and a poly-A RNA sequence through its RRM domain. It further mediates downregulation of the expression of MLL target genes HOXC8, HOXA9, CDKN1B, and C-MYC, in a proline isomerase-dependent manner. Cyp33 also possesses a PPIase activity that catalyzes cis-trans isomerization of the peptide bond preceding a proline, which has been implicated in the stimulation of folding and conformational changes in folded and unfolded proteins. The PPIase activity can be inhibited by the immunosuppressive drug cyclosporin A.


Pssm-ID: 409783 [Multi-domain]  Cd Length: 75  Bit Score: 39.13  E-value: 4.98e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12347     1 LYVGGLAEEVDEKVLHAAFIPFGDIVD--------IQIPLDYETEKHRGFAFVEFEEAEDAAAAID 58
RRM_hnRNPH_ESRPs_RBM12_like cd12254
RNA recognition motif (RRM) found in heterogeneous nuclear ribonucleoprotein (hnRNP) H protein ...
364-429 6.65e-04

RNA recognition motif (RRM) found in heterogeneous nuclear ribonucleoprotein (hnRNP) H protein family, epithelial splicing regulatory proteins (ESRPs), Drosophila RNA-binding protein Fusilli, RNA-binding protein 12 (RBM12) and similar proteins; The family includes RRM domains in the hnRNP H protein family, G-rich sequence factor 1 (GRSF-1), ESRPs (also termed RBM35), Drosophila Fusilli, RBM12 (also termed SWAN), RBM12B, RBM19 (also termed RBD-1) and similar proteins. The hnRNP H protein family includes hnRNP H (also termed mcs94-1), hnRNP H2 (also termed FTP-3 or hnRNP H'), hnRNP F and hnRNP H3 (also termed hnRNP 2H9), which represent a group of nuclear RNA binding proteins that are involved in pre-mRNA processing. GRSF-1 is a cytoplasmic poly(A)+ mRNA binding protein which interacts with RNA in a G-rich element-dependent manner. It may function in RNA packaging, stabilization of RNA secondary structure, or other macromolecular interactions. ESRP1 (also termed RBM35A) and ESRP2 (also termed RBM35B) are epithelial-specific RNA binding proteins that promote splicing of the epithelial variant of fibroblast growth factor receptor 2 (FGFR2), ENAH (also termed hMena), CD44 and CTNND1 (also termed p120-Catenin) transcripts. Fusilli shows high sequence homology to ESRPs. It can regulate endogenous FGFR2 splicing and functions as a splicing factor. The biological roles of both, RBM12 and RBM12B, remain unclear. RBM19 is a nucleolar protein conserved in eukaryotes. It is involved in ribosome biogenesis by processing rRNA. In addition, it is essential for preimplantation development. Members in this family contain 2~6 conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409699 [Multi-domain]  Cd Length: 73  Bit Score: 38.70  E-value: 6.65e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNkrtgqpmIHIyLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12254     2 VRLRGLPFSATEEDIRDFFSGLDIPPDG-------IHI-VYDDDGRPTGEAYVEFASEEDAQRALR 59
RRM_Aly_REF_like cd12418
RNA recognition motif (RRM) found in the Aly/REF family; This subfamily corresponds to the RRM ...
364-431 7.54e-04

RNA recognition motif (RRM) found in the Aly/REF family; This subfamily corresponds to the RRM of Aly/REF family which includes THO complex subunit 4 (THOC4, also termed Aly/REF), S6K1 Aly/REF-like target (SKAR, also termed PDIP3 or PDIP46) and similar proteins. THOC4 is an mRNA transporter protein with a well conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). It is involved in RNA transportation from the nucleus, and was initially identified as a transcription coactivator of LEF-1 and AML-1 for the TCRalpha enhancer function. In addition, THOC4 specifically binds to rhesus (RH) promoter in erythroid, and might be a novel transcription cofactor for erythroid-specific genes. SKAR shows high sequence homology with THOC4 and possesses one RRM as well. SKAR is widely expressed and localizes to the nucleus. It may be a critical player in the function of S6K1 in cell and organism growth control by binding the activated, hyperphosphorylated form of S6K1 but not S6K2. Furthermore, SKAR functions as a protein partner of the p50 subunit of DNA polymerase delta. In addition, SKAR may have particular importance in pancreatic beta cell size determination and insulin secretion.


Pssm-ID: 409852 [Multi-domain]  Cd Length: 75  Bit Score: 38.33  E-value: 7.54e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKeTGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12418     3 VRVSNLHPDVTEEDLRELFGRVGPVKS--------VKINYDR-SGRSTGTAYVVFERPEDAEKAIKQF 61
RRM2_PHIP1 cd12272
RNA recognition motif 2 (RRM2) found in Arabidopsis thaliana phragmoplastin interacting ...
364-429 1.03e-03

RNA recognition motif 2 (RRM2) found in Arabidopsis thaliana phragmoplastin interacting protein 1 (PHIP1) and similar proteins; The CD corresponds to the RRM2 of PHIP1. A. thaliana PHIP1 and its homologs represent a novel class of plant-specific RNA-binding proteins that may play a unique role in the polarized mRNA transport to the vicinity of the cell plate. The family members consist of multiple functional domains, including a lysine-rich domain (KRD domain) that contains three nuclear localization motifs (KKKR/NK), two RNA recognition motifs (RRMs), and three CCHC-type zinc fingers. PHIP1 is a peripheral membrane protein and is localized at the cell plate during cytokinesis in plants. In addition to phragmoplastin, PHIP1 interacts with two Arabidopsis small GTP-binding proteins, Rop1 and Ran2. However, PHIP1 interacted only with the GTP-bound form of Rop1 but not the GDP-bound form. It also binds specifically to Ran2 mRNA.


Pssm-ID: 409715 [Multi-domain]  Cd Length: 73  Bit Score: 38.15  E-value: 1.03e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmNKRTGQpmihiylDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12272     2 VYIGNLAWDIDEDDLRELFAECCEIT-NVRLHT-------DKETGEFKGYGHVEFADEESLDAALK 59
RRM1_hnRNPH_GRSF1_like cd12503
RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein (hnRNP) H ...
364-429 1.20e-03

RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein (hnRNP) H protein family, G-rich sequence factor 1 (GRSF-1) and similar proteins; This subfamily corresponds to the RRM1 of hnRNP H proteins and GRSF-1. The hnRNP H protein family includes hnRNP H (also termed mcs94-1), hnRNP H2 (also termed FTP-3 or hnRNP H'), hnRNP F and hnRNP H3 (also termed hnRNP 2H9), which represent a group of nuclear RNA binding proteins that are involved in pre-mRNA processing. These proteins have similar RNA binding affinities and specifically recognize the sequence GGGA. They can either stimulate or repress splicing upon binding to a GGG motif. hnRNP H binds to the RNA substrate in the presence or absence of these proteins, whereas hnRNP F binds to the nuclear mRNA only in the presence of cap-binding proteins. hnRNP H and hnRNP H2 are almost identical; both have been found to bind nuclear-matrix proteins. hnRNP H activates exon inclusion by binding G-rich intronic elements downstream of the 5' splice site in the transcripts of c-src, human immunodeficiency virus type 1 (HIV-1), Bcl-X, GRIN1, and myelin. It silences exons when bound to exonic elements in the transcripts of beta-tropomyosin, HIV-1, and alpha-tropomyosin. hnRNP H2 has been implicated in pre-mRNA 3' end formation. hnRNP H3 may be involved in splicing arrest induced by heat shock. Most family members contain three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), except for hnRNP H3, in which the RRM1 is absent. RRM1 and RRM2 are responsible for the binding to the RNA at DGGGD motifs, and play an important role in efficiently silencing the exon. Members in this family can regulate the alternative splicing of fibroblast growth factor receptor 2 (FGFR2) transcripts, and function as silencers of FGFR2 exon IIIc through an interaction with the exonic GGG motifs. The lack of RRM1 could account for the reduced silencing activity within hnRNP H3. Members in this family have an extensive glycine-rich region near the C-terminus, which may allow them to homo- or heterodimerize. They also include a cytoplasmic poly(A)+ mRNA binding protein, GRSF-1, which interacts with RNA in a G-rich element-dependent manner. They may function in RNA packaging, stabilization of RNA secondary structure, or other macromolecular interactions. GRSF-1 contains three potential RRMs responsible for the RNA binding, and two auxiliary domains (an acidic alpha-helical domain and an N-terminal alanine-rich region) that may play a role in protein-protein interactions and provide binding specificity.


Pssm-ID: 409926 [Multi-domain]  Cd Length: 77  Bit Score: 38.14  E-value: 1.20e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVvkmnkRTGQPMIHIYLDKEtGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12503     2 VRARGLPWSATAEDVLNFFTDCRI-----KGGENGIHFTYTRE-GRPSGEAFIELESEEDVEKALE 61
RRM2_NUCLs cd12451
RNA recognition motif 2 (RRM2) found in nucleolin-like proteins mainly from plants; This ...
364-429 1.58e-03

RNA recognition motif 2 (RRM2) found in nucleolin-like proteins mainly from plants; This subfamily corresponds to the RRM2 of a group of plant nucleolin-like proteins, including nucleolin 1 (also termed protein nucleolin like 1) and nucleolin 2 (also termed protein nucleolin like 2, or protein parallel like 1). They play roles in the regulation of ribosome synthesis and in the growth and development of plants. Like yeast nucleolin, nucleolin-like proteins possess two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409885 [Multi-domain]  Cd Length: 79  Bit Score: 37.78  E-value: 1.58e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471 364 IYVQGLNDSVTLDDLAD----FFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12451     2 IFVKGFDASLGEDTIRDelreHFGECGEVTN--------VRIPTDRETGELKGFAYIEFSTKEAKEKALE 63
RRM2_Hrp1p cd12330
RNA recognition motif 2 (RRM2) found in yeast nuclear polyadenylated RNA-binding protein 4 ...
364-426 2.03e-03

RNA recognition motif 2 (RRM2) found in yeast nuclear polyadenylated RNA-binding protein 4 (Hrp1p or Nab4p) and similar proteins; This subfamily corresponds to the RRM1 of Hrp1p and similar proteins. Hrp1p or Nab4p, also termed cleavage factor IB (CFIB), is a sequence-specific trans-acting factor that is essential for mRNA 3'-end formation in yeast Saccharomyces cerevisiae. It can be UV cross-linked to RNA and specifically recognizes the (UA)6 RNA element required for both, the cleavage and poly(A) addition steps. Moreover, Hrp1p can shuttle between the nucleus and the cytoplasm, and play an additional role in the export of mRNAs to the cytoplasm. Hrp1p also interacts with Rna15p and Rna14p, two components of CF1A. In addition, Hrp1p functions as a factor directly involved in modulating the activity of the nonsense-mediated mRNA decay (NMD) pathway; it binds specifically to a downstream sequence element (DSE)-containing RNA and interacts with Upf1p, a component of the surveillance complex, further triggering the NMD pathway. Hrp1p contains two central RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and an arginine-glycine-rich region harboring repeats of the sequence RGGF/Y.


Pssm-ID: 409767 [Multi-domain]  Cd Length: 78  Bit Score: 37.30  E-value: 2.03e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrTGQPMihiyLDKETGKPKGDATVSYEDPPTAKA 426
Cdd:cd12330     2 IFVGGLAPDVTEEEFKEYFEQFGTVV----DAVVM----LDHDTGRSRGFGFVTFDSESAVEK 56
RRM1_PHIP1 cd12271
RNA recognition motif 1 (RRM1) found in Arabidopsis thaliana phragmoplastin interacting ...
364-430 2.36e-03

RNA recognition motif 1 (RRM1) found in Arabidopsis thaliana phragmoplastin interacting protein 1 (PHIP1) and similar proteins; This subfamily corresponds to the RRM1 of PHIP1. A. thaliana PHIP1 and its homologs represent a novel class of plant-specific RNA-binding proteins that may play a unique role in the polarized mRNA transport to the vicinity of the cell plate. The family members consist of multiple functional domains, including a lysine-rich domain (KRD domain) that contains three nuclear localization motifs (KKKR/NK), two RNA recognition motifs (RRMs), and three CCHC-type zinc fingers. PHIP1 is a peripheral membrane protein and is localized at the cell plate during cytokinesis in plants. In addition to phragmoplastin, PHIP1 interacts with two Arabidopsis small GTP-binding proteins, Rop1 and Ran2. However, PHIP1 interacted only with the GTP-bound form of Rop1 but not the GDP-bound form. It also binds specifically to Ran2 mRNA.


Pssm-ID: 409714 [Multi-domain]  Cd Length: 72  Bit Score: 36.92  E-value: 2.36e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVEW 430
Cdd:cd12271     1 VYVGGIPYYSTEAEIRSYFSSCGEVRS--------VDLMRFPDSGNFRGIAFITFKTEEAAKRALAL 59
RRM3_Prp24 cd12298
RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar ...
364-429 2.46e-03

RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar proteins; This subfamily corresponds to the RRM3 of Prp24, also termed U4/U6 snRNA-associated-splicing factor PRP24 (U4/U6 snRNP), an RNA-binding protein with four well conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). It facilitates U6 RNA base-pairing with U4 RNA during spliceosome assembly. Prp24 specifically binds free U6 RNA primarily with RRMs 1 and 2 and facilitates pairing of U6 RNA bases with U4 RNA bases. Additionally, it may also be involved in dissociation of the U4/U6 complex during spliceosome activation.


Pssm-ID: 409739 [Multi-domain]  Cd Length: 78  Bit Score: 37.24  E-value: 2.46e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKmnkrtgqpMIHIYLDKETGKPK---GDATVSYEDPPTAKAAVE 429
Cdd:cd12298     3 IRVRNLDFELDEEALRGIFEKFGEIE--------SINIPKKQKNRKGRhnnGFAFVTFEDADSAESALQ 63
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
120-213 2.52e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.95  E-value: 2.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034628471 120 GTQPAYPAYGQQPAATAPTRPQ--DGNKPTETSQ-PQSSTG--GYNQPSLGYGQSNySYPQVPGSYPMQPVTAPPSYppT 194
Cdd:TIGR01628 393 GAMGQPPYYGQGPQQQFNGQPLgwPRMSMMPTPMgPGGPLRpnGLAPMNAVRAPSR-NAQNAAQKPPMQPVMYPPNY--Q 469
                          90
                  ....*....|....*....
gi 1034628471 195 SYSSTQPTSYDQSSYSQQN 213
Cdd:TIGR01628 470 SLPLSQDLPQPQSTASQGG 488
RRM_II_PABPN1L cd12551
RNA recognition motif in vertebrate type II embryonic polyadenylate-binding protein 2 (ePABP-2) ...
363-429 2.97e-03

RNA recognition motif in vertebrate type II embryonic polyadenylate-binding protein 2 (ePABP-2); This subgroup corresponds to the RRM of ePABP-2, also termed embryonic poly(A)-binding protein 2, or poly(A)-binding protein nuclear-like 1 (PABPN1L). ePABP-2 is a novel embryonic-specific cytoplasmic type II poly(A)-binding protein that is expressed during the early stages of vertebrate development and in adult ovarian tissue. It may play an important role in the poly(A) metabolism of stored mRNAs during early vertebrate development. ePABP-2 shows significant sequence similarity to the ubiquitously expressed nuclear polyadenylate-binding protein 2 (PABP-2 or PABPN1). Like PABP-2, ePABP-2 contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), which is responsible for the poly(A) binding. In addition, it possesses an acidic N-terminal domain predicted to form a coiled-coil and an arginine-rich C-terminal domain.


Pssm-ID: 409967 [Multi-domain]  Cd Length: 77  Bit Score: 36.73  E-value: 2.97e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034628471 363 AIYVQGLNDSVTLDDLADFFKQCGVVkmNKRTgqpmihIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12551     1 SVYVGNVDYGSTADELEAHFNGCGPI--NRVT------ILCDKFSGHPKGYAYIEFATRSSVQAAVA 59
RRM5_RBM12_like cd12515
RNA recognition motif 5 (RRM5) found in RNA-binding protein RBM12, RBM12B and similar proteins; ...
366-429 3.35e-03

RNA recognition motif 5 (RRM5) found in RNA-binding protein RBM12, RBM12B and similar proteins; This subfamily corresponds to the RRM5 of RBM12 and RBM12B. RBM12, also termed SH3/WW domain anchor protein in the nucleus (SWAN), is ubiquitously expressed. It contains five distinct RNA binding motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), two proline-rich regions, and several putative transmembrane domains. RBM12B show high sequence semilarity with RBM12. It contains five distinct RRMs as well. The biological roles of both RBM12 and RBM12B remain unclear.


Pssm-ID: 409937 [Multi-domain]  Cd Length: 75  Bit Score: 36.82  E-value: 3.35e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034628471 366 VQGLNDSVTLDDLADFFKQCGVVKmnkrtGQPMIHiYLDKetGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12515     5 MRNLPFKATIEDILDFFYGYRVIP-----DSVSIR-YNDD--GQPTGDARVAFPSPREARRAVR 60
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
364-428 3.41e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.56  E-value: 3.41e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNKrtgqpmihIYLDkETGKPKGDATVSYEDPPTAKAAV 428
Cdd:TIGR01628 288 LYVKNLDDTVTDEKLRELFSECGEITSAK--------VMLD-EKGVSRGFGFVCFSNPEEANRAV 343
RRM_CFIm68_CFIm59 cd12372
RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 68 kDa subunit (CFIm68 or ...
364-431 4.12e-03

RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 68 kDa subunit (CFIm68 or CPSF6), pre-mRNA cleavage factor Im 59 kDa subunit (CFIm59 or CPSF7), and similar proteins; This subfamily corresponds to the RRM of cleavage factor Im (CFIm) subunits. Cleavage factor Im (CFIm) is a highly conserved component of the eukaryotic mRNA 3' processing machinery that functions in UGUA-mediated poly(A) site recognition, the regulation of alternative poly(A) site selection, mRNA export, and mRNA splicing. It is a complex composed of a small 25 kDa (CFIm25) subunit and a larger 59/68/72 kDa subunit. Two separate genes, CPSF6 and CPSF7, code for two isoforms of the large subunit, CFIm68 and CFIm59. Structurally related CFIm68 and CFIm59, also termed cleavage and polyadenylation specificity factor subunit 6 (CPSF7), or cleavage and polyadenylation specificity factor 59 kDa subunit (CPSF59), are functionally redundant. Both contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a central proline-rich region, and a C-terminal RS-like domain. Their N-terminal RRM mediates the interaction with CFIm25, and also serves to enhance RNA binding and facilitate RNA looping.


Pssm-ID: 409807 [Multi-domain]  Cd Length: 76  Bit Score: 36.52  E-value: 4.12e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMNKrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWF 431
Cdd:cd12372     1 LYVGNLQWWTTDEDLEGACASFGVVDVKE------IKFFEHKANGKSKGYAYVEFASPAAAAAVKEKL 62
RRM1_SF3B4 cd12334
RNA recognition motif 1 (RRM1) found in splicing factor 3B subunit 4 (SF3B4) and similar ...
364-428 4.40e-03

RNA recognition motif 1 (RRM1) found in splicing factor 3B subunit 4 (SF3B4) and similar proteins; This subfamily corresponds to the RRM1 of SF3B4, also termed pre-mRNA-splicing factor SF3b 49 kDa (SF3b50), or spliceosome-associated protein 49 (SAP 49). SF3B4 a component of the multiprotein complex splicing factor 3b (SF3B), an integral part of the U2 small nuclear ribonucleoprotein (snRNP) and the U11/U12 di-snRNP. SF3B is essential for the accurate excision of introns from pre-messenger RNA, and is involved in the recognition of the pre-mRNA's branch site within the major and minor spliceosomes. SF3B4 functions to tether U2 snRNP with pre-mRNA at the branch site during spliceosome assembly. It is an evolutionarily highly conserved protein with orthologs across diverse species. SF3B4 contains two closely adjacent N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). It binds directly to pre-mRNA and also interacts directly and highly specifically with another SF3B subunit called SAP 145.


Pssm-ID: 409771 [Multi-domain]  Cd Length: 74  Bit Score: 36.43  E-value: 4.40e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCG-VVKmnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAV 428
Cdd:cd12334     1 VYVGNLDEKVTEELLWELFIQAGpVVN---------VHMPKDRVTQQHQGYGFVEFLSEEDADYAI 57
RRM2_Nop13p_fungi cd12397
RNA recognition motif 2 (RRM2) found in yeast nucleolar protein 13 (Nop13p) and similar ...
364-429 4.67e-03

RNA recognition motif 2 (RRM2) found in yeast nucleolar protein 13 (Nop13p) and similar proteins; This subfamily corresponds to the RRM2 of Nop13p encoded by YNL175c from Saccharomyces cerevisiae. It shares high sequence similarity with nucleolar protein 12 (Nop12p). Both Nop12p and Nop13p are not essential for growth. However, unlike Nop12p that is localized to the nucleolus, Nop13p localizes primarily to the nucleolus but is also present in the nucleoplasm to a lesser extent. Nop13p contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409831 [Multi-domain]  Cd Length: 76  Bit Score: 36.27  E-value: 4.67e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12397     1 LFVGNLSFETTEEDLRKHFAPAGKIRK--------VRMATFEDSGKCKGFAFVDFKEIESATNAVK 58
RRM2_RBM40_like cd12239
RNA recognition motif 2 (RRM2) found in RNA-binding protein 40 (RBM40) and similar proteins; ...
361-429 5.08e-03

RNA recognition motif 2 (RRM2) found in RNA-binding protein 40 (RBM40) and similar proteins; This subfamily corresponds to the RRM2 of RBM40 and the RRM of RBM41. RBM40, also known as RNA-binding region-containing protein 3 (RNPC3) or U11/U12 small nuclear ribonucleoprotein 65 kDa protein (U11/U12-65K protein). It serves as a bridging factor between the U11 and U12 snRNPs. It contains two RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), connected by a linker that includes a proline-rich region. It binds to the U11-associated 59K protein via its RRM1 and employs the RRM2 to bind hairpin III of the U12 small nuclear RNA (snRNA). The proline-rich region might be involved in protein-protein interactions. RBM41 contains only one RRM. Its biological function remains unclear.


Pssm-ID: 409685 [Multi-domain]  Cd Length: 82  Bit Score: 36.44  E-value: 5.08e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034628471 361 NSAIYVQGLNDSVTLDDLADFFkqcGVVKMNKRTGQPMIHIYLDKEtGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12239     1 SNRLYVKNLSKRVSEKDLKYIF---GRFVDSSSEEKNMFDIRLMTE-GRMKGQAFITFPSEELAEKALN 65
RRM_CIRBP_RBM3 cd12449
RNA recognition motif (RRM) found in cold inducible RNA binding protein (CIRBP), RNA binding ...
364-429 6.46e-03

RNA recognition motif (RRM) found in cold inducible RNA binding protein (CIRBP), RNA binding motif protein 3 (RBM3) and similar proteins; This subfamily corresponds to the RRM domain of two structurally related heterogenous nuclear ribonucleoproteins, CIRBP (also termed CIRP or A18 hnRNP) and RBM3 (also termed RNPL), both of which belong to a highly conserved cold shock proteins family. The cold shock proteins can be induced after exposure to a moderate cold-shock and other cellular stresses such as UV radiation and hypoxia. CIRBP and RBM3 may function in posttranscriptional regulation of gene expression by binding to different transcripts, thus allowing the cell to response rapidly to environmental signals. However, the kinetics and degree of cold induction are different between CIRBP and RBM3. Tissue distribution of their expression is different. CIRBP and RBM3 may be differentially regulated under physiological and stress conditions and may play distinct roles in cold responses of cells. CIRBP, also termed glycine-rich RNA-binding protein CIRP, is localized in the nucleus and mediates the cold-induced suppression of cell cycle progression. CIRBP also binds DNA and possibly serves as a chaperone that assists in the folding/unfolding, assembly/disassembly and transport of various proteins. RBM3 may enhance global protein synthesis and the formation of active polysomes while reducing the levels of ribonucleoprotein complexes containing microRNAs. RBM3 may also serve to prevent the loss of muscle mass by its ability to decrease cell death. Furthermore, RBM3 may be essential for cell proliferation and mitosis. Both, CIRBP and RBM3, contain an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), that is involved in RNA binding, and C-terminal glycine-rich domain (RGG motif) that probably enhances RNA-binding via protein-protein and/or protein-RNA interactions. Like CIRBP, RBM3 can also bind to both RNA and DNA via its RRM domain.


Pssm-ID: 409883 [Multi-domain]  Cd Length: 80  Bit Score: 35.92  E-value: 6.46e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVKMnkrtgqpmIHIYLDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12449     3 LFVGGLSFDTNEQSLEEVFSKYGQISE--------VVVVKDRETQRSRGFGFVTFENPDDAKDAMM 60
RRM1_hnRNPA_hnRNPD_like cd12325
RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein hnRNP A and ...
364-429 6.77e-03

RNA recognition motif 1 (RRM1) found in heterogeneous nuclear ribonucleoprotein hnRNP A and hnRNP D subfamilies and similar proteins; This subfamily corresponds to the RRM1 in the hnRNP A subfamily which includes hnRNP A0, hnRNP A1, hnRNP A2/B1, hnRNP A3 and similar proteins. hnRNP A0 is a low abundance hnRNP protein that has been implicated in mRNA stability in mammalian cells. hnRNP A1 is an abundant eukaryotic nuclear RNA-binding protein that may modulate splice site selection in pre-mRNA splicing. hnRNP A2/B1 is an RNA trafficking response element-binding protein that interacts with the hnRNP A2 response element (A2RE). hnRNP A3 is also a RNA trafficking response element-binding protein that participates in the trafficking of A2RE-containing RNA. The hnRNP A subfamily is characterized by two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), followed by a long glycine-rich region at the C-terminus. The hnRNP D subfamily includes hnRNP D0, hnRNP A/B, hnRNP DL and similar proteins. hnRNP D0 is a UUAG-specific nuclear RNA binding protein that may be involved in pre-mRNA splicing and telomere elongation. hnRNP A/B is an RNA unwinding protein with a high affinity for G- followed by U-rich regions. hnRNP A/B has also been identified as an APOBEC1-binding protein that interacts with apolipoprotein B (apoB) mRNA transcripts around the editing site and thus, plays an important role in apoB mRNA editing. hnRNP DL (or hnRNP D-like) is a dual functional protein that possesses DNA- and RNA-binding properties. It has been implicated in mRNA biogenesis at the transcriptional and post-transcriptional levels. All members in this subfamily contain two putative RRMs and a glycine- and tyrosine-rich C-terminus. The family also contains DAZAP1 (Deleted in azoospermia-associated protein 1), RNA-binding protein Musashi homolog Musashi-1, Musashi-2 and similar proteins. They all harbor two RRMs.


Pssm-ID: 409763 [Multi-domain]  Cd Length: 72  Bit Score: 35.58  E-value: 6.77e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCG-----VVKMnkrtgqpmihiylDKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12325     1 LFVGGLSWETTEESLREYFSKYGevvdcVVMK-------------DPATGRSRGFGFVTFKDPSSVDAVLA 58
RRM1_La cd12291
RNA recognition motif 1 in La autoantigen (La or LARP3) and similar proteins; This subfamily ...
364-429 9.61e-03

RNA recognition motif 1 in La autoantigen (La or LARP3) and similar proteins; This subfamily corresponds to the RRM1 of La autoantigen, also termed Lupus La protein, or La ribonucleoprotein, or Sjoegren syndrome type B antigen (SS-B), a highly abundant nuclear phosphoprotein and well conserved in eukaryotes. It specifically binds the 3'-terminal UUU-OH motif of nascent RNA polymerase III transcripts and protects them from exonucleolytic degradation by 3' exonucleases. In addition, La can directly facilitate the translation and/or metabolism of many UUU-3' OH-lacking cellular and viral mRNAs, through binding internal RNA sequences within the untranslated regions of target mRNAs. La contains an N-terminal La motif (LAM), followed by two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). It also possesses a short basic motif (SBM) and a nuclear localization signal (NLS) at the C-terminus.


Pssm-ID: 409733 [Multi-domain]  Cd Length: 73  Bit Score: 35.26  E-value: 9.61e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034628471 364 IYVQGLNDSVTLDDLADFFKQCGVVkmnkrtgqpmIHIYL--DKETGKPKGDATVSYEDPPTAKAAVE 429
Cdd:cd12291     2 VYVKGFPLDATLDDIQEFFEKKGKV----------ENVRMrrDLDSKEFKGSVFVEFKTEEEAKKFLE 59
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH