NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|75200774|sp|Q9SAJ5|]
View 

RecName: Full=Pentatricopeptide repeat-containing protein At1g79540

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1015190)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner; similar to Arabidopsis pentatricopeptide repeat proteins that play constitutive, often essential roles in mitochondria and chloroplasts, probably via binding to organellar transcripts

Gene Ontology:  GO:0003723
PubMed:  23635770

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
181-559 8.45e-28

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 120.75  E-value: 8.45e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   181 LAFAVYNEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLCQRGSADDARKLFYEM- 259
Cdd:PLN03218  490 AMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMk 569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   260 -QTSGNYPDSVAHNALLDGFCKLGRmVEafellrlfekdgfvlglrgysslidglfRARrytqafELYANMLKKNIKPDI 338
Cdd:PLN03218  570 aETHPIDPDHITVGALMKACANAGQ-VD----------------------------RAK------EVYQMIHEYNIKGTP 614
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   339 ILYTILIQGLSKAGKIEDALKLLSSMPSKGISPDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESFPDACTHTILICSM 418
Cdd:PLN03218  615 EVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGAC 694
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   419 CRNGLVREAEEIFTEIEKSGCSPSVATFNALIDGLCksgelkEARLLLHKMEVgrpaslflrlshsgnrsfdtmvesgsi 498
Cdd:PLN03218  695 SNAKNWKKALELYEDIKSIKLRPTVSTMNALITALC------EGNQLPKALEV--------------------------- 741
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 75200774   499 lkayrdLAHFADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLI 559
Cdd:PLN03218  742 ------LSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT 796
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
421-712 6.49e-09

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 59.51  E-value: 6.49e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   421 NGLVREAEEIFTEIEKSGCSP--SVATFNALIDGLCKSGELKEARLLLHKMEVGRPASLfLRLSHSgnRSFDTMVESGSI 498
Cdd:PLN03218  346 QAASDVEEENSLAAYNGGVSGkrKSPEYIDAYNRLLRDGRIKDCIDLLEDMEKRGLLDM-DKIYHA--KFFKACKKQRAV 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   499 LKAYRdlahFADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLINGLHRVGREEEAFKLFYAK 578
Cdd:PLN03218  423 KEAFR----FAKLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM 498
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   579 DDFRHSPAV--YRSLMTWSCRKRKVLVAFNLWMKYLKKISCLDDETANE-IEQCFKEGETERALRRLIELDTRK-----D 650
Cdd:PLN03218  499 VNAGVEANVhtFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNAlISACGQSGAVDRAFDVLAEMKAEThpidpD 578
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 75200774   651 ELTLGPytiwLIGLC-QSGRFHEALMVFSVLREKKILVTPPSCVKLIHGLCKREQLDAAIEVF 712
Cdd:PLN03218  579 HITVGA----LMKACaNAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIY 637
PPR_3 super family cl37870
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
115-171 7.11e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


The actual alignment was detected with superfamily member pfam13812:

Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 46.97  E-value: 7.11e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 75200774   115 LEELKSGGVSVDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRV 171
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGV 59
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
181-559 8.45e-28

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 120.75  E-value: 8.45e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   181 LAFAVYNEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLCQRGSADDARKLFYEM- 259
Cdd:PLN03218  490 AMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMk 569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   260 -QTSGNYPDSVAHNALLDGFCKLGRmVEafellrlfekdgfvlglrgysslidglfRARrytqafELYANMLKKNIKPDI 338
Cdd:PLN03218  570 aETHPIDPDHITVGALMKACANAGQ-VD----------------------------RAK------EVYQMIHEYNIKGTP 614
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   339 ILYTILIQGLSKAGKIEDALKLLSSMPSKGISPDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESFPDACTHTILICSM 418
Cdd:PLN03218  615 EVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGAC 694
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   419 CRNGLVREAEEIFTEIEKSGCSPSVATFNALIDGLCksgelkEARLLLHKMEVgrpaslflrlshsgnrsfdtmvesgsi 498
Cdd:PLN03218  695 SNAKNWKKALELYEDIKSIKLRPTVSTMNALITALC------EGNQLPKALEV--------------------------- 741
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 75200774   499 lkayrdLAHFADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLI 559
Cdd:PLN03218  742 ------LSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT 796
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
515-564 1.96e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.55  E-value: 1.96e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 75200774   515 PDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLINGLHR 564
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
421-712 6.49e-09

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 59.51  E-value: 6.49e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   421 NGLVREAEEIFTEIEKSGCSP--SVATFNALIDGLCKSGELKEARLLLHKMEVGRPASLfLRLSHSgnRSFDTMVESGSI 498
Cdd:PLN03218  346 QAASDVEEENSLAAYNGGVSGkrKSPEYIDAYNRLLRDGRIKDCIDLLEDMEKRGLLDM-DKIYHA--KFFKACKKQRAV 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   499 LKAYRdlahFADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLINGLHRVGREEEAFKLFYAK 578
Cdd:PLN03218  423 KEAFR----FAKLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM 498
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   579 DDFRHSPAV--YRSLMTWSCRKRKVLVAFNLWMKYLKKISCLDDETANE-IEQCFKEGETERALRRLIELDTRK-----D 650
Cdd:PLN03218  499 VNAGVEANVhtFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNAlISACGQSGAVDRAFDVLAEMKAEThpidpD 578
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 75200774   651 ELTLGPytiwLIGLC-QSGRFHEALMVFSVLREKKILVTPPSCVKLIHGLCKREQLDAAIEVF 712
Cdd:PLN03218  579 HITVGA----LMKACaNAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIY 637
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
546-575 1.94e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.42  E-value: 1.94e-08
                          10        20        30
                  ....*....|....*....|....*....|
gi 75200774   546 KGLSPDSVTYNTLINGLHRVGREEEAFKLF 575
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELL 30
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
115-171 7.11e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 46.97  E-value: 7.11e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 75200774   115 LEELKSGGVSVDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRV 171
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGV 59
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
339-373 2.93e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.37  E-value: 2.93e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 75200774   339 ILYTILIQGLSKAGKIEDALKLLSSMPSKGISPDT 373
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
128-162 8.14e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.14e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 75200774   128 YCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDV 162
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
181-559 8.45e-28

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 120.75  E-value: 8.45e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   181 LAFAVYNEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLCQRGSADDARKLFYEM- 259
Cdd:PLN03218  490 AMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMk 569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   260 -QTSGNYPDSVAHNALLDGFCKLGRmVEafellrlfekdgfvlglrgysslidglfRARrytqafELYANMLKKNIKPDI 338
Cdd:PLN03218  570 aETHPIDPDHITVGALMKACANAGQ-VD----------------------------RAK------EVYQMIHEYNIKGTP 614
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   339 ILYTILIQGLSKAGKIEDALKLLSSMPSKGISPDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESFPDACTHTILICSM 418
Cdd:PLN03218  615 EVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGAC 694
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   419 CRNGLVREAEEIFTEIEKSGCSPSVATFNALIDGLCksgelkEARLLLHKMEVgrpaslflrlshsgnrsfdtmvesgsi 498
Cdd:PLN03218  695 SNAKNWKKALELYEDIKSIKLRPTVSTMNALITALC------EGNQLPKALEV--------------------------- 741
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 75200774   499 lkayrdLAHFADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLI 559
Cdd:PLN03218  742 ------LSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT 796
PLN03218 PLN03218
maturation of RBCL 1; Provisional
231-599 2.31e-26

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 116.13  E-value: 2.31e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   231 PNRVTYTILISGLCQRGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMVEAFELLRLFEKDGFVLGLRGYSSLI 310
Cdd:PLN03218  435 PTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALI 514
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   311 DGLFRARRYTQAFELYANMLKKNIKPDIILYTILIQGLSKAGKIEDALKLLSSM--PSKGISPDTYCYNAVIKALCGRGL 388
Cdd:PLN03218  515 DGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMkaETHPIDPDHITVGALMKACANAGQ 594
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   389 LEEGRSLQLEMSE--TESFPDACTHTILICSmcRNGLVREAEEIFTEIEKSGCSPSVATFNALIDGLCKSGELKEARLLL 466
Cdd:PLN03218  595 VDRAKEVYQMIHEynIKGTPEVYTIAVNSCS--QKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEIL 672
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   467 HKMEvgrpaSLFLRLshsGNRSFDTMVESGSILKAYRD-LAHFADTGSS---PDIVSYNVLINGFCRAGDIDGALKLLNV 542
Cdd:PLN03218  673 QDAR-----KQGIKL---GTVSYSSLMGACSNAKNWKKaLELYEDIKSIklrPTVSTMNALITALCEGNQLPKALEVLSE 744
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 75200774   543 LQLKGLSPDSVTYNTLINGLHRVGREEEAFKLF-YAKDD-FRHSPAVYRSLmTWSCRKR 599
Cdd:PLN03218  745 MKRLGLCPNTITYSILLVASERKDDADVGLDLLsQAKEDgIKPNLVMCRCI-TGLCLRR 802
PLN03218 PLN03218
maturation of RBCL 1; Provisional
109-382 1.07e-22

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 104.19  E-value: 1.07e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   109 DLYWQTLEELKSGGVSVDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRVMMREEVFfMLAFAVYNE 188
Cdd:PLN03218  489 DAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAV-DRAFDVLAE 567
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   189 MlKCNCSP---NLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLCQRGSADDARKLFYEMQTSGNY 265
Cdd:PLN03218  568 M-KAETHPidpDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVK 646
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   266 PDSVAHNALLDGFCKLGRMVEAFELLRLFEKDGFVLGLRGYSSLIDGLFRARRYTQAFELYANMLKKNIKPDIILYTILI 345
Cdd:PLN03218  647 PDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALI 726
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 75200774   346 QGLSKAGKIEDALKLLSSMPSKGISPDTYCYNAVIKA 382
Cdd:PLN03218  727 TALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVA 763
PLN03077 PLN03077
Protein ECB2; Provisional
126-780 4.43e-22

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 101.85  E-value: 4.43e-22
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  126 DSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILR--------VMMREEVFFMLAFAVYNEMLKCNCSPN 197
Cdd:PLN03077 151 DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRtcggipdlARGREVHAHVVRFGFELDVDVVNALIT 230
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  198 LYTfgilmdglyKKGRTSDAQKMFDDMTGRgispNRVTYTILISGLCQRGSADDARKLFYEMQTSGNYPD-----SVAHN 272
Cdd:PLN03077 231 MYV---------KCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFFTMRELSVDPDlmtitSVISA 297
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  273 ALLDGFCKLGRMVEAFELlrlfeKDGFVLGLRGYSSLIDGLFRARRYTQAFELYANMLKKnikpDIILYTILIQGLSKAG 352
Cdd:PLN03077 298 CELLGDERLGREMHGYVV-----KTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNG 368
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  353 KIEDALKLLSSMPSKGISPDTYCYNAVIKALCGRGLLEEGRSLQlEMSETE---SFPDACTHTILICSMCRNglVREAEE 429
Cdd:PLN03077 369 LPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLH-ELAERKgliSYVVVANALIEMYSKCKC--IDKALE 445
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  430 IFTEIEKSgcspSVATFNALIDGLCKSGELKEARLLLHKMEVG-RPASLFLRLSHSG----------------------- 485
Cdd:PLN03077 446 VFHNIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQMLLTlKPNSVTLIAALSAcarigalmcgkeihahvlrtgig 521
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  486 ------NRSFDTMVESGSILKAYrdlAHFadTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLI 559
Cdd:PLN03077 522 fdgflpNALLDLYVRCGRMNYAW---NQF--NSHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLL 596
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  560 NGLHRVGREEEAFKLFYA-KDDFRHSPAV--YRSLMTWSCRKRKVLVAFNlwmkYLKKISCLDDE-------TANEIEQC 629
Cdd:PLN03077 597 CACSRSGMVTQGLEYFHSmEEKYSITPNLkhYACVVDLLGRAGKLTEAYN----FINKMPITPDPavwgallNACRIHRH 672
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  630 FKEGETerALRRLIELDTRkdelTLGpYTIWLIGL-CQSGRFHEALMVFSVLREKKILVTPPscvklihglCKREQLDAA 708
Cdd:PLN03077 673 VELGEL--AAQHIFELDPN----SVG-YYILLCNLyADAGKWDEVARVRKTMRENGLTVDPG---------CSWVEVKGK 736
                        650       660       670       680       690       700       710
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 75200774  709 IEVFLYtlDNNFKLMPRVCNYLLSSLLESTEKMEIVSQLTNRMERagynvDSMLRFEILKYHRHRKQVLIDL 780
Cdd:PLN03077 737 VHAFLT--DDESHPQIKEINTVLEGFYEKMKASGLAGSESSSMDE-----IEVSKDDIFCGHSERLAIAFGL 801
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
133-549 1.60e-16

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 83.77  E-value: 1.60e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  133 LISAYAKMGMAEKAVESFGRMKEF--DCRPDVFTynvilrVMMREEVFFMLAFA---VYNEMLKCNCSPNLYTFGILMDG 207
Cdd:PLN03081 195 IIGGLVDAGNYREAFALFREMWEDgsDAEPRTFV------VMLRASAGLGSARAgqqLHCCVLKTGVVGDTFVSCALIDM 268
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  208 LYKKGRTSDAQKMFDDMTGRGIspnrVTYTILISGLCQRGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMVEA 287
Cdd:PLN03081 269 YSKCGDIEDARCVFDGMPEKTT----VAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHA 344
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  288 FELLRLFEKDGFVLGLRGYSSLIDGLFRARRYTQAFELYANMLKKNIkpdiILYTILIQGLSKAGKIEDALKLLSSMPSK 367
Cdd:PLN03081 345 KQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNL----ISWNALIAGYGNHGRGTKAVEMFERMIAE 420
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  368 GISPDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESF-PDACTHTILICSMCRNGLVREAeeiFTEIEKSGCSPSVATF 446
Cdd:PLN03081 421 GVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENHRIkPRAMHYACMIELLGREGLLDEA---YAMIRRAPFKPTVNMW 497
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  447 NALIDGlCKsgelkearllLHK-MEVGRPASLFLrlshsgnrsfdtmvesgsilkayrdlahfadTGSSPD-IVSYNVLI 524
Cdd:PLN03081 498 AALLTA-CR----------IHKnLELGRLAAEKL-------------------------------YGMGPEkLNNYVVLL 535
                        410       420
                 ....*....|....*....|....*
gi 75200774  525 NGFCRAGDIDGALKLLNVLQLKGLS 549
Cdd:PLN03081 536 NLYNSSGRQAEAAKVVETLKRKGLS 560
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
515-564 1.96e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.55  E-value: 1.96e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 75200774   515 PDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLINGLHR 564
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
336-384 1.48e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.16  E-value: 1.48e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 75200774   336 PDIILYTILIQGLSKAGKIEDALKLLSSMPSKGISPDTYCYNAVIKALC 384
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PLN03077 PLN03077
Protein ECB2; Provisional
112-436 1.73e-14

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 77.58  E-value: 1.73e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  112 WQTLEELKSGGVSVDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILR-----------VMMRE---EV 177
Cdd:PLN03077 339 WGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSacaclgdldvgVKLHElaeRK 418
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  178 FFMLAFAVYNEML----KCNC------------SPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRgISPNRVTYTILIS 241
Cdd:PLN03077 419 GLISYVVVANALIemysKCKCidkalevfhnipEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALS 497
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  242 GLCQRGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMVEAFELLRLFEKDgfvlgLRGYSSLIDGLFRARRYTQ 321
Cdd:PLN03077 498 ACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFNSHEKD-----VVSWNILLTGYVAHGKGSM 572
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  322 AFELYANMLKKNIKPDIILYTILIQGLSKAGKIEDALKLLSSMPSK-GISPDTYCYNAVIKALCGRGLLEEGRSLQLEMS 400
Cdd:PLN03077 573 AVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLTEAYNFINKMP 652
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|
gi 75200774  401 ETesfPDACTHTILI--CSMCRNGLVRE--AEEIFtEIEK 436
Cdd:PLN03077 653 IT---PDPAVWGALLnaCRIHRHVELGElaAQHIF-ELDP 688
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
196-245 2.81e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.39  E-value: 2.81e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 75200774   196 PNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLCQ 245
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
207-702 3.38e-14

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 76.81  E-value: 3.38e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  207 GLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISgLCQ-RGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMV 285
Cdd:PLN03077  60 ALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCEwKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELV 138
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  286 EAFELL-RLFEKDGFvlglrGYSSLIDGLFRARRYTQAFELYANMLKKNIKPDIilYTI--------------------- 343
Cdd:PLN03077 139 HAWYVFgKMPERDLF-----SWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDV--YTFpcvlrtcggipdlargrevha 211
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  344 ----------------LIQGLSKAGKIEDALKLLSSMPSKgispDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESFPD 407
Cdd:PLN03077 212 hvvrfgfeldvdvvnaLITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFFTMRELSVDPD 287
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  408 ACTHTILICSMCRNGLVREAEEIFTEIEKSGCSPSVATFNALIDGLCKSGELKEARLLLHKMEvgrpaslflrlshsgnr 487
Cdd:PLN03077 288 LMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRME----------------- 350
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  488 sfdtmvesgsilkayrdlahfadtgsSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLINGLHRVGR 567
Cdd:PLN03077 351 --------------------------TKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGD 404
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  568 EEEAFKLfyakddfrHSPAVYRSLMTWscrkrkVLVAFNLWMKYlKKISCLDdeTANEIEQCFKEgeteralrrlieldt 647
Cdd:PLN03077 405 LDVGVKL--------HELAERKGLISY------VVVANALIEMY-SKCKCID--KALEVFHNIPE--------------- 452
                        490       500       510       520       530
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 75200774  648 rKDELTlgpYTIWLIGLCQSGRFHEALMVFsvlrEKKILVTPPSCVKLIHGL--CKR 702
Cdd:PLN03077 453 -KDVIS---WTSIIAGLRLNNRCFEALIFF----RQMLLTLKPNSVTLIAALsaCAR 501
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
406-455 6.99e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.54  E-value: 6.99e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 75200774   406 PDACTHTILICSMCRNGLVREAEEIFTEIEKSGCSPSVATFNALIDGLCK 455
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
282-575 8.24e-13

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 71.83  E-value: 8.24e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  282 GRMVEAFELLRLFEKDG-FVLGLRGYSSLIDGLFRARRYTQAFELYANMLKKNIKPDIILYTILIQGLSKAGKIEDALKL 360
Cdd:PLN03081 101 GRHREALELFEILEAGCpFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRL 180
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  361 LSSMPSKGIspdtYCYNAVIKALCGRGLLEEGRSLQLEMseTESFPDACTHTILIcsMCRN----GLVREAEEIFTEIEK 436
Cdd:PLN03081 181 FDEMPERNL----ASWGTIIGGLVDAGNYREAFALFREM--WEDGSDAEPRTFVV--MLRAsaglGSARAGQQLHCCVLK 252
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  437 SGCSPSVATFNALIDGLCKSGELKEARLLLHKME----VG---------------RPASLFLRLSHSG----NRSFDTMV 493
Cdd:PLN03081 253 TGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPekttVAwnsmlagyalhgyseEALCLYYEMRDSGvsidQFTFSIMI 332
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  494 ----ESGSILKAYRDLAHFADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLspdsVTYNTLINGLHRVGREE 569
Cdd:PLN03081 333 rifsRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNL----ISWNALIAGYGNHGRGT 408

                 ....*.
gi 75200774  570 EAFKLF 575
Cdd:PLN03081 409 KAVEMF 414
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
231-280 1.10e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 60.07  E-value: 1.10e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 75200774   231 PNRVTYTILISGLCQRGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCK 280
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
160-210 1.30e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 60.07  E-value: 1.30e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 75200774   160 PDVFTYNVILRVMMREEVFfMLAFAVYNEMLKCNCSPNLYTFGILMDGLYK 210
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKV-EEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
306-350 4.44e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.53  E-value: 4.44e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 75200774   306 YSSLIDGLFRARRYTQAFELYANMLKKNIKPDIILYTILIQGLSK 350
Cdd:pfam13041   6 YNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
438-470 1.12e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.89  E-value: 1.12e-09
                          10        20        30
                  ....*....|....*....|....*....|...
gi 75200774   438 GCSPSVATFNALIDGLCKSGELKEARLLLHKME 470
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
227-259 1.38e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.89  E-value: 1.38e-09
                          10        20        30
                  ....*....|....*....|....*....|...
gi 75200774   227 RGISPNRVTYTILISGLCQRGSADDARKLFYEM 259
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
512-541 4.39e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 52.35  E-value: 4.39e-09
                          10        20        30
                  ....*....|....*....|....*....|
gi 75200774   512 GSSPDIVSYNVLINGFCRAGDIDGALKLLN 541
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLD 31
PLN03218 PLN03218
maturation of RBCL 1; Provisional
421-712 6.49e-09

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 59.51  E-value: 6.49e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   421 NGLVREAEEIFTEIEKSGCSP--SVATFNALIDGLCKSGELKEARLLLHKMEVGRPASLfLRLSHSgnRSFDTMVESGSI 498
Cdd:PLN03218  346 QAASDVEEENSLAAYNGGVSGkrKSPEYIDAYNRLLRDGRIKDCIDLLEDMEKRGLLDM-DKIYHA--KFFKACKKQRAV 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   499 LKAYRdlahFADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLINGLHRVGREEEAFKLFYAK 578
Cdd:PLN03218  423 KEAFR----FAKLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM 498
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   579 DDFRHSPAV--YRSLMTWSCRKRKVLVAFNLWMKYLKKISCLDDETANE-IEQCFKEGETERALRRLIELDTRK-----D 650
Cdd:PLN03218  499 VNAGVEANVhtFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNAlISACGQSGAVDRAFDVLAEMKAEThpidpD 578
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 75200774   651 ELTLGPytiwLIGLC-QSGRFHEALMVFSVLREKKILVTPPSCVKLIHGLCKREQLDAAIEVF 712
Cdd:PLN03218  579 HITVGA----LMKACaNAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIY 637
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
115-375 1.06e-08

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 58.73  E-value: 1.06e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  115 LEELKSGGVSVDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVIL-------RVMMREEVFFMLAfavyn 187
Cdd:PLN03081 313 YYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVdlyskwgRMEDARNVFDRMP----- 387
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  188 emlkcncSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLCQRGSADDARKLFYEM-QTSGNYP 266
Cdd:PLN03081 388 -------RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMsENHRIKP 460
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  267 DSVAHNALLDGFCKLGRMVEAFELLRlfeKDGFVLGLRGYSSLIDGLfRARRYTQAFELYANMLKKNIKPDIILYTILIQ 346
Cdd:PLN03081 461 RAMHYACMIELLGREGLLDEAYAMIR---RAPFKPTVNMWAALLTAC-RIHKNLELGRLAAEKLYGMGPEKLNNYVVLLN 536
                        250       260
                 ....*....|....*....|....*....
gi 75200774  347 GLSKAGKIEDALKLLSSMPSKGISPDTYC 375
Cdd:PLN03081 537 LYNSSGRQAEAAKVVETLKRKGLSMHPAC 565
PLN03077 PLN03077
Protein ECB2; Provisional
345-593 1.50e-08

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 58.32  E-value: 1.50e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  345 IQGLSKAGKIEDALKLLSSMPSKGISPDTYCYNAVIKaLC--GRGLLEEGRSLQLEMSETESFPDACTHTILicSM-CRN 421
Cdd:PLN03077  58 LRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCewKRAVEEGSRVCSRALSSHPSLGVRLGNAML--SMfVRF 134
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  422 GLVREAEEIFTEIEKSgcspSVATFNALIDGLCKSGELKEARLLLHKME-VGRPASLFlrlshsgnrSFDTMVES-GSI- 498
Cdd:PLN03077 135 GELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLwAGVRPDVY---------TFPCVLRTcGGIp 201
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  499 -LKAYRDL-AHFADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKglspDSVTYNTLINGLHRVGREEEAFKLFY 576
Cdd:PLN03077 202 dLARGREVhAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFF 277
                        250
                 ....*....|....*..
gi 75200774  577 AKDDFRHSPavyrSLMT 593
Cdd:PLN03077 278 TMRELSVDP----DLMT 290
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
546-575 1.94e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.42  E-value: 1.94e-08
                          10        20        30
                  ....*....|....*....|....*....|
gi 75200774   546 KGLSPDSVTYNTLINGLHRVGREEEAFKLF 575
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELL 30
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
441-529 5.05e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 49.67  E-value: 5.05e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   441 PSVATFNALIDGLCKSGELKEARLLLHKMEvgrpaslflrlshsgnrsfdtmvesgsilkayrdlahfaDTGSSPDIVSY 520
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMK---------------------------------------KRGVKPNVYTY 41

                  ....*....
gi 75200774   521 NVLINGFCR 529
Cdd:pfam13041  42 TILINGLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
328-386 7.61e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 49.66  E-value: 7.61e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 75200774   328 NMLKKNIKPDIILYTILIQGLSKAGKIEDALKLLSSMPSKGISPDTYCYNAVIKALCGR 386
Cdd:pfam13812   5 EMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
332-364 4.59e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 46.57  E-value: 4.59e-07
                          10        20        30
                  ....*....|....*....|....*....|...
gi 75200774   332 KNIKPDIILYTILIQGLSKAGKIEDALKLLSSM 364
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
115-171 7.11e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 46.97  E-value: 7.11e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 75200774   115 LEELKSGGVSVDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRV 171
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGV 59
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
126-170 9.66e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.20  E-value: 9.66e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 75200774   126 DSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILR 170
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILIN 46
PLN03218 PLN03218
maturation of RBCL 1; Provisional
109-246 1.23e-06

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 52.19  E-value: 1.23e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   109 DLYWQTLEELKSGGVSVDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRVMMREEVFfMLAFAVYNE 188
Cdd:PLN03218  666 DKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQL-PKALEVLSE 744
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 75200774   189 MLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILIsGLCQR 246
Cdd:PLN03218  745 MKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLCLR 801
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
405-435 1.66e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 45.03  E-value: 1.66e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 75200774   405 FPDACTHTILICSMCRNGLVREAEEIFTEIE 435
Cdd:pfam12854   4 KPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
193-225 2.76e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 44.26  E-value: 2.76e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 75200774   193 NCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMT 225
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
339-373 2.93e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.37  E-value: 2.93e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 75200774   339 ILYTILIQGLSKAGKIEDALKLLSSMPSKGISPDT 373
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
518-552 2.96e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.37  E-value: 2.96e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 75200774   518 VSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDS 552
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
182-289 5.22e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 48.16  E-value: 5.22e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   182 AFAVYNEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQK---------MFDDMTGRGISPNRVTYTILISGLCQRGSADDA 252
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLYLCSKAADATDLKPqlaadrgfeVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLA 109
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 75200774   253 RKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMVEAFE 289
Cdd:pfam17177 110 FDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYE 146
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
234-268 9.61e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.83  E-value: 9.61e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 75200774   234 VTYTILISGLCQRGSADDARKLFYEMQTSGNYPDS 268
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
550-575 1.25e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.12  E-value: 1.25e-05
                          10        20
                  ....*....|....*....|....*.
gi 75200774   550 PDSVTYNTLINGLHRVGREEEAFKLF 575
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLF 26
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
409-443 1.26e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.44  E-value: 1.26e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 75200774   409 CTHTILICSMCRNGLVREAEEIFTEIEKSGCSPSV 443
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
266-292 1.73e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 41.95  E-value: 1.73e-05
                          10        20
                  ....*....|....*....|....*..
gi 75200774   266 PDSVAHNALLDGFCKLGRMVEAFELLR 292
Cdd:pfam12854   5 PDVVTYNTLINGLCRAGRVDEAFELLD 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
234-263 1.83e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.07  E-value: 1.83e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 75200774   234 VTYTILISGLCQRGSADDARKLFYEMQTSG 263
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
306-338 2.75e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 2.75e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 75200774   306 YSSLIDGLFRARRYTQAFELYANMLKKNIKPDI 338
Cdd:TIGR00756   3 YNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
518-548 9.25e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.14  E-value: 9.25e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 75200774   518 VSYNVLINGFCRAGDIDGALKLLNVLQLKGL 548
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
339-369 1.45e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 39.37  E-value: 1.45e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 75200774   339 ILYTILIQGLSKAGKIEDALKLLSSMPSKGI 369
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
512-560 1.68e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.42  E-value: 1.68e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 75200774   512 GSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLSPDSVTYNTLIN 560
Cdd:pfam13812  10 GIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
219-275 1.82e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.03  E-value: 1.82e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 75200774   219 KMFDDMTGRGISPNRVTYTILISGLCQRGSADDARKLFYEMQTSGNYPDSVAHNALL 275
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
406-454 2.20e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.03  E-value: 2.20e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 75200774   406 PDACTHTILICSMCRNGLVREAEEIFTEIEKSGCSPSVATFNALIdGLC 454
Cdd:pfam13812  13 LNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
445-470 5.36e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 5.36e-04
                          10        20
                  ....*....|....*....|....*.
gi 75200774   445 TFNALIDGLCKSGELKEARLLLHKME 470
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMK 27
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
128-162 8.14e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.14e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 75200774   128 YCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDV 162
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
445-470 8.22e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.22e-04
                          10        20
                  ....*....|....*....|....*.
gi 75200774   445 TFNALIDGLCKSGELKEARLLLHKME 470
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMK 27
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
410-439 8.94e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 8.94e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 75200774   410 THTILICSMCRNGLVREAEEIFTEIEKSGC 439
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
432-575 9.27e-04

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 42.55  E-value: 9.27e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774  432 TEIEKSGcspsvATFNALIDGLCKSGELKEARLLLHKMEVGRPASLflrlshsGNRSFDTMVESGSILKAYRDLA----H 507
Cdd:PLN03081  81 TQIRKSG-----VSLCSQIEKLVACGRHREALELFEILEAGCPFTL-------PASTYDALVEACIALKSIRCVKavywH 148
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 75200774  508 FADTGSSPDIVSYNVLINGFCRAGDIDGALKLLNVLQLKGLspdsVTYNTLINGLHRVGREEEAFKLF 575
Cdd:PLN03081 149 VESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNL----ASWGTIIGGLVDAGNYREAFALF 212
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
290-345 1.13e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 37.72  E-value: 1.13e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 75200774   290 LLRLFEKDGFVLGLRGYSSLIDGLFRARRYTQAFELYANMLKKNIKPDIILYTILI 345
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
269-301 1.81e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 1.81e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 75200774   269 VAHNALLDGFCKLGRMVEAFELLRLFEKDGFVL 301
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
269-299 1.87e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 36.29  E-value: 1.87e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 75200774   269 VAHNALLDGFCKLGRMVEAFELLRLFEKDGF 299
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
128-158 2.61e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.90  E-value: 2.61e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 75200774   128 YCFCVLISAYAKMGMAEKAVESFGRMKEFDC 158
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
116-250 3.07e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 39.69  E-value: 3.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 75200774   116 EELKSGGVSVDSYCFCVLI---------SAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRVMMREEVFfMLAFAVY 186
Cdd:pfam17177  35 DAAKAEGVRLAQYHYNVLLylcskaadaTDLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDG-DLAFDLV 113
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 75200774   187 NEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLCQRGSAD 250
Cdd:pfam17177 114 KEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRAD 177
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
199-232 3.82e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 3.82e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 75200774   199 YTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPN 232
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
367-391 6.35e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.01  E-value: 6.35e-03
                          10        20
                  ....*....|....*....|....*
gi 75200774   367 KGISPDTYCYNAVIKALCGRGLLEE 391
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDE 25
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
553-575 7.86e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 7.86e-03
                          10        20
                  ....*....|....*....|...
gi 75200774   553 VTYNTLINGLHRVGREEEAFKLF 575
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELF 23
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
199-229 7.94e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 7.94e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 75200774   199 YTFGILMDGLYKKGRTSDAQKMFDDMTGRGI 229
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH