Identification, distribution, and expression of novel genes in 10 clinical isolates of nontypeable Haemophilus influenzae

Infect Immun. 2005 Jun;73(6):3479-91. doi: 10.1128/IAI.73.6.3479-3491.2005.

Abstract

We hypothesize that Haemophilus influenzae, as a species, possesses a much greater number of genes than that found in any single H. influenzae genome. This supragenome is distributed throughout naturally occurring infectious populations, and new strains arise through autocompetence and autotransformation systems. The effect is that H. influenzae populations can readily adapt to environmental stressors. The supragenome hypothesis predicts that significant differences exist between and among the genomes of individual infectious strains of nontypeable H. influenzae (NTHi). To test this prediction, we obtained 10 low-passage NTHi clinical isolates from the middle ear effusions of patients with chronic otitis media. DNA sequencing was performed with 771 clones chosen at random from a pooled genomic library. Homology searching demonstrated that approximately 10% of these clones were novel compared to the H. influenzae Rd KW20 genome, and most of them did not match any DNA sequence in GenBank. Amino acid homology searches using hypothetical translations of the open reading frames revealed homologies to a variety of proteins, including bacterial virulence factors not previously identified in the NTHi isolates. The distribution and expression of 53 of these genes among the 10 strains were determined by PCR- and reverse transcription PCR-based analyses. These unique genes were nonuniformly distributed among the 10 isolates, and transcription of these genes in planktonic cultures was detected in 50% (177 of 352) of the occurrences. All of the novel sequences were transcribed in one or more of the NTHi isolates. Seventeen percent (9 of 53) of the novel genes were identified in all 10 NTHi strains, with each of the remaining 44 being present in only a subset of the strains. These genic distribution analyses were more effective as a strain discrimination tool than either multilocus sequence typing or 23S ribosomal gene typing methods.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Base Sequence
  • DNA, Bacterial / chemistry
  • Genome, Bacterial
  • Genomic Islands
  • Haemophilus influenzae / classification
  • Haemophilus influenzae / genetics*
  • Haemophilus influenzae / pathogenicity
  • Humans
  • Molecular Sequence Data
  • Phylogeny
  • RNA, Ribosomal, 23S / genetics
  • Repetitive Sequences, Amino Acid
  • Virulence

Substances

  • DNA, Bacterial
  • RNA, Ribosomal, 23S

Associated data

  • GENBANK/AY599423
  • GENBANK/AY599424
  • GENBANK/AY599425
  • GENBANK/AY599426
  • GENBANK/AY599427
  • GENBANK/AY599428
  • GENBANK/AY599429
  • GENBANK/AY599430
  • GENBANK/AY599431
  • GENBANK/AY599432
  • GENBANK/AY599433
  • GENBANK/AY599434
  • GENBANK/AY599435
  • GENBANK/AY599436
  • GENBANK/AY599437
  • GENBANK/AY599438
  • GENBANK/AY599439
  • GENBANK/AY599440
  • GENBANK/AY599441
  • GENBANK/AY599442
  • GENBANK/AY599443
  • GENBANK/AY599444
  • GENBANK/AY599445
  • GENBANK/AY599446
  • GENBANK/AY599447
  • GENBANK/AY599448
  • GENBANK/AY599449
  • GENBANK/AY599450
  • GENBANK/AY599451
  • GENBANK/AY599452
  • GENBANK/AY599453
  • GENBANK/AY599454
  • GENBANK/AY599455
  • GENBANK/AY599456
  • GENBANK/AY599457
  • GENBANK/AY599458
  • GENBANK/AY599459
  • GENBANK/AY599460
  • GENBANK/AY599461
  • GENBANK/AY599462
  • GENBANK/AY599463
  • GENBANK/AY599464
  • GENBANK/AY599465
  • GENBANK/AY599466
  • GENBANK/AY599467
  • GENBANK/AY599468
  • GENBANK/AY599469
  • GENBANK/AY599470
  • GENBANK/AY599471
  • GENBANK/AY599472
  • GENBANK/AY599473
  • GENBANK/AY599474
  • GENBANK/AY599475
  • GENBANK/AY599476
  • GENBANK/AY599477
  • GENBANK/AY599478
  • GENBANK/AY599479
  • GENBANK/AY599480
  • GENBANK/AY599481
  • GENBANK/AY599482
  • GENBANK/AY599483
  • GENBANK/AY599484
  • GENBANK/AY599485
  • GENBANK/AY599486