Organization of the psbE, psbF, orf38, and orf42 gene loci on the Euglena gracilis chloroplast genome

Curr Genet. 1988 Feb;13(2):173-80. doi: 10.1007/BF00365652.

Abstract

The genes for cytochrome b559, designated psbE and psbF, and two highly conserved open reading frames of 38 and 42 codons have been located and characterized on the chloroplast genome of Euglena gracilis. The organization of the genes is psbE - 8 bp spacer - psbF - 110 bp spacer - orf38 - 87 bp spacer - orf42. All genes are of the same polarity. The psbE gene contains two introns of 350 and 326 bp. The psbF gene contains a single large intron of 1,042 bp. The orf38 and orf42 loci lack introns. The introns are extremely AT rich with a pronounced base composition bias of T greater than A greater than G greater than C in the mRNA-like strand and group II-like boundary sequences at their 3' and 5' ends having the consensus 5'-GTGTG .. INTRON .. TTAATTTNAT-3'. The psbE gene consists of 82 codons and encodes a polypeptide with a predicted molecular weight of 9,212. The psbF gene consists of 42 codons, which specify a polypeptide with a predicted molecular weight of 4,785. The highly conserved open reading frames of 38 and 42 codons code for polypeptides with predicted molecular weights of 4,405 and 4,426, respectively. The gene products of psbE, psbF, orf38 and orf42 are, respectively, 69.5%, 70% and 61.5% identical to those found in higher plants. The predicted secondary structure of the proteins from hydropathy plots is consistent with each containing a single membrane-spanning domain of at least 20 amino acids. Each of the genes is preceded by sequences which may serve as ribosome binding sites. All four genes are transcribed.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Chloroplasts / metabolism*
  • Cytochrome b Group / genetics*
  • DNA, Ribosomal / genetics
  • Euglena gracilis / genetics*
  • Genes*
  • Molecular Sequence Data
  • Photosystem II Protein Complex*
  • Sequence Homology, Nucleic Acid
  • Species Specificity

Substances

  • Cytochrome b Group
  • DNA, Ribosomal
  • Photosystem II Protein Complex
  • cytochrome b559

Associated data

  • GENBANK/X07073