Article thumbnail
Location of Repository

Gain and Loss of Multiple Genes During the Evolution of Helicobacter pylori

By Helga Gressmann, Bodo Linz, Rohit Ghai, Klaus-Peter Pleissner, Ralph Schlapbach, Yoshio Yamaoka, Christian Kraft, Sebastian Suerbaum, Thomas F Meyer and Mark Achtman


Sequence diversity and gene content distinguish most isolates of Helicobacter pylori. Even greater sequence differences differentiate distinct populations of H. pylori from different continents, but it was not clear whether these populations also differ in gene content. To address this question, we tested 56 globally representative strains of H. pylori and four strains of Helicobacter acinonychis with whole genome microarrays. Of the weighted average of 1,531 genes present in the two sequenced genomes, 25% are absent in at least one strain of H. pylori and 21% were absent or variable in H. acinonychis. We extrapolate that the core genome present in all isolates of H. pylori contains 1,111 genes. Variable genes tend to be small and possess unusual GC content; many of them have probably been imported by horizontal gene transfer. Phylogenetic trees based on the microarray data differ from those based on sequences of seven genes from the core genome. These discrepancies are due to homoplasies resulting from independent gene loss by deletion or recombination in multiple strains, which distort phylogenetic patterns. The patterns of these discrepancies versus population structure allow a reconstruction of the timing of the acquisition of variable genes within this species. Variable genes that are located within the cag pathogenicity island were apparently first acquired en bloc after speciation. In contrast, most other variable genes are of unknown function or encode restriction/modification enzymes, transposases, or outer membrane proteins. These seem to have been acquired prior to speciation of H. pylori and were subsequently lost by convergent evolution within individual strains. Thus, the use of microarrays can reveal patterns of gene gain or loss when examined within a phylogenetic context that is based on sequences of core genes

Topics: Research Article
Publisher: Public Library of Science
Year: 2005
DOI identifier: 10.1371/journal.pgen.0010043
OAI identifier:
Provided by: PubMed Central

Suggested articles


  1. (2002). A phylogenetic perspective on molecular epidemiology.
  2. (2003). A revised annotation and comparative analysis of Helicobacter pylori genomes.
  3. (2005). ACT: The artemis comparison tool.
  4. (1997). Amelioration of bacterial genomes: Rates of change and exchange.
  5. (2003). Application of DNA microarrays to study the evolutionary genomics of Yersinia pestis and Yersinia pseudotuberculosis.
  6. (1996). cag, a pathogenicity island of Helicobacter pylori, encodes type-I specific and disease-associated virulence factors.
  7. (1999). cagA gene and vacA alleles in Spanish Helicobacter pylori clinical isolates from patients of different ages.
  8. (1998). cagA-positive Helicobacter pylori populations in China and The Netherlands are distinct.
  9. (2004). Characterization of Salmonella enterica subspecies I genovars by use of microarrays.
  10. (1998). Chronic gastritis in tigers associated with Helicobacter acinonyx.
  11. (2003). Complete genome sequence and analysis of Wolinella succinogenes.
  12. (2000). Covacci A
  13. (1992). DNA diversity among clinical isolates of Helicobacter pylori detected by PCR-based RAPD fingerprinting.
  14. (2004). DNA microarray analysis of genome dynamics in Yersinia pestis: Insights into bacterial genome microevolution and niche adaptation.
  15. (2001). Emergence of diverse Helicobacter species in the pathogenesis of gastric and enterohepatic diseases.
  16. (1999). Emergence of recombinant strains of Helicobacter pylori during human infection.
  17. (1998). Empirical statistical estimates for sequence similarity searches.
  18. (2005). Evolutionary origins of genomic repertoires in bacteria.
  19. (2000). Flexible sequence similarity searching with the FASTA3 program package.
  20. (1998). Free recombination within Helicobacter pylori.
  21. (2004). Functional adaptation of BabA, the H. pylori ABO blood group antigen binding adhesin.
  22. (2004). Functional and evolutionary genomics of Mycobacterium tuberculosis: Insights from genomic deletions in 100 strains.
  23. (2004). Genome-wide analysis of transcriptional hierarchy and feedback regulation in the flagellar system of Helicobacter pylori.
  24. (2004). GenomeViz: Visualizing microbial genomes.
  25. (1999). Genomicsequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori.
  26. (1999). Geographic distribution of vacA allelic types of Helicobacter pylori.
  27. (2004). Helicobacter acinonychis: Genetic and rodent infection studies of a Helicobacter pylori-like gastric pathogen of cheetahs and other big cats.
  28. (1993). Helicobacter acinonyx sp. nov., isolated from cheetahs with gastritis.
  29. (2002). Helicobacter nemestrinae ATCC 49396 is a strain of Helicobacter pylori
  30. (2001). Helicobacter pylori genetic diversity within the gastric niche of a single human host.
  31. (2002). Helicobacter pylori infection.
  32. (2002). Helicobacter pylori strain and the pattern of gastritis among first-degree relatives of patients with gastric carcinoma.
  33. (1999). Helicobacter pylori virulence and genetic geography.
  34. (1993). How clonal are bacteria?
  35. (2002). Improved analytical methods for microarray-based genome-composition analysis.
  36. (2001). MEGA2: Molecular evolutionary genetics analysis software.
  37. (2004). Metastability of Helicobacter pylori bab adhesin genes and dynamics in Lewis b antigen binding.
  38. (2001). microarray reveals genetic diversity among Helicobacter pylori strains.
  39. (2004). Microevolution and history of the plague bacillus, Yersinia pestis.
  40. (2003). Multi-locus sequence typing: A tool for global epidemiology.
  41. (2004). New aspects regarding evolution and virulence of Listeria monocytogenes revealed by comparative genomics and DNA arrays.
  42. (1992). PCR-based RFLP analysis of DNA sequence diversity in the gastric pathogen Helicobacter pylori.
  43. (2004). Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing.
  44. (1996). Population genetic analysis of Helicobacter pylori by multilocus enzyme electrophoresis: Extensive allelic diversity and recombinational population structure.
  45. (2003). Presence of active aliphatic amidases in Helicobacter species able to colonize the stomach.
  46. (2001). PrimeArray: Genome-scale primer design for DNA-microarray construction.
  47. (2000). Quasispecies development of Helicobacter pylori observed in paired isolates obtained years apart from the same host.
  48. (1996). R: A language for data analysis and graphics.
  49. (1999). Recombination and clonal groupings within Helicobacter pylori from different geographical regions.
  50. (2001). Recombination and mutation during long-term gastric colonization by Helicobacter pylori: Estimates of clock rates, recombination size, and minimal age.
  51. (2001). Recombination within natural populations of pathogenic bacteria: Shortterm empirical estimates and long-term phylogenetic comparisons.
  52. (1997). Restricted structural gene polymorphism in the Mycobacterium tuberculosis complex indicates evolutionarily recent global dissemination.
  53. (1998). Restriction-modification gene complexes as selfish gene entities: Roles of a regulatory system in their establishment, maintenance, and apoptotic mutual exclusion.
  54. (2004). Stable association between strains of Mycobacterium tuberculosis and their human host populations.
  55. (2003). The complete genome sequence of the carcinogenic bacterium Helicobacter hepaticus.
  56. (1997). The complete genome sequence of the gastric pathogen Helicobacter pylori.
  57. (2004). The diversity within an expanded and redefined repertoire of phase-variable genes in Helicobacter pylori.
  58. (2003). Traces of human migrations in Helicobacter pylori populations.
  59. (2000). Translocation of Helicobacter pylori CagA into gastric epithelial cells by type IV secretion.
  60. (2000). Translocation of the Helicobacter pylori CagA protein in gastric epithelial cells by a type IV secretion apparatus.
  61. (1996). Variability of gene order in different Helicobacter pylori strains contributes to genome diversity.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.