22 research outputs found

    OrgConv: detection of gene conversion using consensus sequences and its application in plant mitochondrial and chloroplast homologs

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The ancestry of mitochondria and chloroplasts traces back to separate endosymbioses of once free-living bacteria. The highly reduced genomes of these two organelles therefore contain very distant homologs that only recently have been shown to recombine inside the mitochondrial genome. Detection of gene conversion between mitochondrial and chloroplast homologs was previously impossible due to the lack of suitable computer programs. Recently, I developed a novel method and have, for the first time, discovered recurrent gene conversion between chloroplast mitochondrial genes. The method will further our understanding of plant organellar genome evolution and help identify and remove gene regions with incongruent phylogenetic signals for several genes widely used in plant systematics. Here, I implement such a method that is available in a user friendly web interface.</p> <p>Results</p> <p><monospace>OrgConv</monospace> (<b>Org</b>anellar <b>Conv</b>ersion) is a computer package developed for detection of gene conversion between mitochondrial and chloroplast homologous genes. <monospace>OrgConv</monospace> is available in two forms; source code can be installed and run on a Linux platform and a web interface is available on multiple operating systems. The input files of the feature program are two multiple sequence alignments from different organellar compartments in FASTA format. The program compares every examined sequence against the consensus sequence of each sequence alignment rather than exhaustively examining every possible combination. Making use of consensus sequences significantly reduces the number of comparisons and therefore reduces overall computational time, which allows for analysis of very large datasets. Most importantly, with the significantly reduced number of comparisons, the statistical power remains high in the face of correction for multiple tests.</p> <p>Conclusions</p> <p>Both the source code and the web interface of <monospace>OrgConv</monospace> are available for free from the <monospace>OrgConv</monospace> website <url>http://www.indiana.edu/~orgconv</url>. Although <monospace>OrgConv</monospace> has been developed with main focus on detection of gene conversion between mitochondrial and chloroplast genes, it may also be used for detection of gene conversion between any two distinct groups of homologous sequences.</p

    Maternal colonization with Streptococcus agalactiae and associated stillbirth and neonatal disease in coastal Kenya

    Get PDF
    Streptococcus agalactiae (group B streptococcus, GBS) causes neonatal disease and stillbirth, but its burden in sub-Saharan Africa is uncertain. We assessed maternal recto-vaginal GBS colonization (7,967 women), stillbirth and neonatal disease. Whole-genome sequencing was used to determine serotypes, sequence types and phylogeny. We found low maternal GBS colonization prevalence (934/7,967, 12%), but comparatively high incidence of GBS-associated stillbirth and early onset neonatal disease (EOD) in hospital (0.91 (0.25-2.3)/1,000 births and 0.76 (0.25-1.77)/1,000 live births, respectively). However, using a population denominator, EOD incidence was considerably reduced (0.13 (0.07-0.21)/1,000 live births). Treated cases of EOD had very high case fatality (17/36, 47%), especially within 24 h of birth, making under-ascertainment of community-born cases highly likely, both here and in similar facility-based studies. Maternal GBS colonization was less common in women with low socio-economic status, HIV infection and undernutrition, but when GBS-colonized, they were more probably colonized by the most virulent clone, CC17. CC17 accounted for 267/915 (29%) of maternal colonizing (265/267 (99%) serotype III; 2/267 (0.7%) serotype IV) and 51/73 (70%) of neonatal disease cases (all serotype III). Trivalent (Ia/II/III) and pentavalent (Ia/Ib/II/III/V) vaccines would cover 71/73 (97%) and 72/73 (99%) of disease-causing serotypes, respectively. Serotype IV should be considered for inclusion, with evidence of capsular switching in CC17 strains

    Population Genetics of Streptococcus dysgalactiae Subspecies equisimilis Reveals Widely Dispersed Clones and Extensive Recombination

    Get PDF
    Streptococcus dysgalactiae subspecies equisimilis (SDSE) is an emerging global pathogen that can colonize and infect humans. Although most SDSE isolates possess the Lancefield group G carbohydrate, a significant minority have the group C carbohydrate. Isolates are further sub-typed on the basis of differences within the emm gene. To gain a better understanding of their molecular epidemiology and evolutionary relationships, multilocus sequence typing (MLST) analysis was performed on SDSE isolates collected from Australia, Europe and North America.The 178 SDSE isolates, representing 37 emm types, segregate into 80 distinct sequence types (STs) that form 17 clonal complexes (CCs). Eight STs recovered from all three continents account for >50% of the isolates. Thus, a small number of STs are highly prevalent and have a wide geographic distribution. Both ST and CC strongly correlate with group carbohydrate. In contrast, eleven STs were associated with >1 emm type, suggestive of recombinational replacements involving the emm gene; furthermore, 35% of the emm types are associated with genetically distant STs. Data also reveal a history of extensive inter- and intra-species recombination involving the housekeeping genes used for MLST. Sequence analysis of single locus variants identified through goeBURST indicates that genetic change mediated by recombination occurred approximately 4.4 times more frequently than by point mutation.A few genetic lineages with an intercontinental distribution dominate among SDSE causing infections in humans. The distinction between group C and G isolates reflects recent evolution, and no long-term genetic isolation between them was found. Lateral gene transfer and recombination involving housekeeping genes and the emm gene are important mechanisms driving genetic variability in the SDSE population

    Monitoring the Long-Term Molecular Epidemiology of the Pneumococcus and Detection of Potential ‘Vaccine Escape’ Strains

    Get PDF
    While the pneumococcal protein conjugate vaccines reduce the incidence in invasive pneumococcal disease (IPD), serotype replacement remains a major concern. Thus, serotype-independent protection with vaccines targeting virulence genes, such as PspA, have been pursued. PspA is comprised of diverse clades that arose through recombination. Therefore, multi-locus sequence typing (MLST)-defined clones could conceivably include strains from multiple PspA clades. As a result, a method is needed which can both monitor the long-term epidemiology of the pneumococcus among a large number of isolates, and analyze vaccine-candidate genes, such as pspA, for mutations and recombination events that could result in 'vaccine escape' strains.We developed a resequencing array consisting of five conserved and six variable genes to characterize 72 pneumococcal strains. The phylogenetic analysis of the 11 concatenated genes was performed with the MrBayes program, the single nucleotide polymorphism (SNP) analysis with the DNA Sequence Polymorphism program (DnaSP), and the recombination event analysis with the recombination detection package (RDP).The phylogenetic analysis correlated with MLST, and identified clonal strains with unique PspA clades. The DnaSP analysis correlated with the serotype-specific diversity detected using MLST. Serotypes associated with more than one ST complex had a larger degree of sequence polymorphism than a serotype associated with one ST complex. The RDP analysis confirmed the high frequency of recombination events in the pspA gene.The phylogenetic tree correlated with MLST, and detected multiple PspA clades among clonal strains. The genetic diversity of the strains and the frequency of recombination events in the mosaic gene, pspA were accurately assessed using the DnaSP and RDP programs, respectively. These data provide proof-of-concept that resequencing arrays could play an important role within research and clinical laboratories in both monitoring the molecular epidemiology of the pneumococcus and detecting 'vaccine escape' strains among vaccine-candidate genes
    corecore