New genotypes of Helicobacter Pylori VacA d-region identified from global strains

BMC Mol Cell Biol. 2021 Jan 7;22(1):4. doi: 10.1186/s12860-020-00338-2.

Abstract

Background: Pathogenesis of Helicobacter Pylori (HP) vacuolating toxin A (vacA) depends on polymorphic diversity within the signal (s), middle (m), intermediate (i), deletion (d) and c-regions. These regions show distinct allelic diversity. The s-region, m-region and the c-region (a 15 bp deletion at the 3'-end region of the p55 domain of the vacA gene) exist as 2 types (s1, s2, m1, m2, c1 and c2), while the i-region has 3 allelic types (i1, i2 and i3). The locus of d-region of the vacA gene has also been classified into 2 genotypes, namely d1 and d2. We investigated the "d-region"/"loop region" through bioinformatics, to predict its properties and relation to disease. One thousand two hundred fifty-nine strains from the NCBI nucleotide database and the dryad database with complete vacA sequences were included in the study. The sequences were aligned using BioEdit and analyzed using Lasergene and BLAST. The secondary structure and physicochemical properties of the region were predicted using PredictProtein.

Results: We identified 31 highly polymorphic genotypes in the "d-region", with a mean length of 34 amino acids (9 ~ 55 amino acids). We further classified the 31 genotypes into 3 main types, namely K-type (strains starting with the KDKP motif in the "d-region"), Q-type (strains starting with the KNQT motif), and E-type (strains starting with the ESKT motif) respectively. The most common type, K-type, is more prevalent in cancer patients (80.87%) and is associated with the s1i1m1c1 genotypes (P < .01). Incidentally, a new region expressing sequence diversity (2 aa deletion) at the C-terminus of the p55 domain of vacA was identified during bioinformatics analysis.

Conclusions: Prediction of secondary structures shows that the "d-region" adopts a loop conformation and is a disordered region.

Keywords: Bioinformatics; Helicobacter pylori; Intrinsically disordered proteins; Polymorphism; Vacuolating toxin a.

MeSH terms

  • Amino Acid Motifs
  • Amino Acid Sequence
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics*
  • Genotype
  • Helicobacter pylori / genetics*
  • Humans
  • Prevalence
  • Protein Structure, Secondary
  • Solvents
  • Stomach Neoplasms / epidemiology
  • Stomach Neoplasms / microbiology

Substances

  • Bacterial Proteins
  • Solvents
  • VacA protein, Helicobacter pylori