24 research outputs found

    FAAST: Flow-space Assisted Alignment Search Tool

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>High throughput pyrosequencing (454 sequencing) is the major sequencing platform for producing long read high throughput data. While most other sequencing techniques produce reading errors mainly comparable with substitutions, pyrosequencing produce errors mainly comparable with gaps. These errors are less efficiently detected by most conventional alignment programs and may produce inaccurate alignments.</p> <p>Results</p> <p>We suggest a novel algorithm for calculating the optimal local alignment which utilises flowpeak information in order to improve alignment accuracy. Flowpeak information can be retained from a 454 sequencing run through interpretation of the binary SFF-file format. This novel algorithm has been implemented in a program named FAAST (Flow-space Assisted Alignment Search Tool).</p> <p>Conclusions</p> <p>We present and discuss the results of simulations that show that FAAST, through the use of the novel algorithm, can gain several percentage points of accuracy compared to Smith-Waterman-Gotoh alignments, depending on the 454 data quality. Furthermore, through an efficient multi-thread aware implementation, FAAST is able to perform these high quality alignments at high speed.</p> <p>The tool is available at <url>http://www.ifm.liu.se/bioinfo/</url></p

    FAAST: Flow-space Assisted Alignment Search Tool

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>High throughput pyrosequencing (454 sequencing) is the major sequencing platform for producing long read high throughput data. While most other sequencing techniques produce reading errors mainly comparable with substitutions, pyrosequencing produce errors mainly comparable with gaps. These errors are less efficiently detected by most conventional alignment programs and may produce inaccurate alignments.</p> <p>Results</p> <p>We suggest a novel algorithm for calculating the optimal local alignment which utilises flowpeak information in order to improve alignment accuracy. Flowpeak information can be retained from a 454 sequencing run through interpretation of the binary SFF-file format. This novel algorithm has been implemented in a program named FAAST (Flow-space Assisted Alignment Search Tool).</p> <p>Conclusions</p> <p>We present and discuss the results of simulations that show that FAAST, through the use of the novel algorithm, can gain several percentage points of accuracy compared to Smith-Waterman-Gotoh alignments, depending on the 454 data quality. Furthermore, through an efficient multi-thread aware implementation, FAAST is able to perform these high quality alignments at high speed.</p> <p>The tool is available at <url>http://www.ifm.liu.se/bioinfo/</url></p

    Genome analysis and comparative genomics of a Giardia intestinalis assemblage E isolate

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Giardia intestinalis </it>is a protozoan parasite that causes diarrhea in a wide range of mammalian species. To further understand the genetic diversity between the <it>Giardia intestinalis </it>species, we have performed genome sequencing and analysis of a wild-type <it>Giardia intestinalis </it>sample from the assemblage E group, isolated from a pig.</p> <p>Results</p> <p>We identified 5012 protein coding genes, the majority of which are conserved compared to the previously sequenced genomes of the WB and GS strains in terms of microsynteny and sequence identity. Despite this, there is an unexpectedly large number of chromosomal rearrangements and several smaller structural changes that are present in all chromosomes. Novel members of the VSP, NEK Kinase and HCMP gene families were identified, which may reveal possible mechanisms for host specificity and new avenues for antigenic variation. We used comparative genomics of the three diverse <it>Giardia intestinalis </it>isolates P15, GS and WB to define a core proteome for this species complex and to identify lineage-specific genes. Extensive analyses of polymorphisms in the core proteome of <it>Giardia </it>revealed differential rates of divergence among cellular processes.</p> <p>Conclusions</p> <p>Our results indicate that despite a well conserved core of genes there is significant genome variation between <it>Giardia </it>isolates, both in terms of gene content, gene polymorphisms, structural chromosomal variations and surface molecule repertoires. This study improves the annotation of the <it>Giardia </it>genomes and enables the identification of functionally important variation.</p

    Draft Genome Sequencing of Giardia intestinalis Assemblage B Isolate GS: Is Human Giardiasis Caused by Two Different Species?

    Get PDF
    Giardia intestinalis is a major cause of diarrheal disease worldwide and two major Giardia genotypes, assemblages A and B, infect humans. The genome of assemblage A parasite WB was recently sequenced, and the structurally compact 11.7 Mbp genome contains simplified basic cellular machineries and metabolism. We here performed 454 sequencing to 16× coverage of the assemblage B isolate GS, the only Giardia isolate successfully used to experimentally infect animals and humans. The two genomes show 77% nucleotide and 78% amino-acid identity in protein coding regions. Comparative analysis identified 28 unique GS and 3 unique WB protein coding genes, and the variable surface protein (VSP) repertoires of the two isolates are completely different. The promoters of several enzymes involved in the synthesis of the cyst-wall lack binding sites for encystation-specific transcription factors in GS. Several synteny-breaks were detected and verified. The tetraploid GS genome shows higher levels of overall allelic sequence polymorphism (0.5 versus <0.01% in WB). The genomic differences between WB and GS may explain some of the observed biological and clinical differences between the two isolates, and it suggests that assemblage A and B Giardia can be two different species

    Oxygen induces the expression of invasion and stress response genes in the anaerobic salmon parasite Spironucleus salmonicida

    No full text
    Background: Spironucleus salmonicida is an anaerobic parasite that can cause systemic infections in Atlantic salmon. Unlike other diplomonad parasites, such as the human pathogen Giardia intestinalis, Spironucleus species can infiltrate the blood stream of their hosts eventually colonizing organs, skin and gills. How this presumed anaerobe can persist and invade oxygenated tissues, despite having a strictly anaerobic metabolism, remains elusive. Results: To investigate how S. salmonicida response to oxygen stress, we performed RNAseq transcriptomic analyses of cells grown in the presence of oxygen or antioxidant-free medium. We found that over 20% of the transcriptome is differentially regulated in oxygen (1705 genes) and antioxidant-depleted (2280 genes) conditions. These differentially regulated transcripts encode proteins related to anaerobic metabolism, cysteine and Fe-S cluster biosynthesis, as well as a large number of proteins of unknown function. S. salmonicida does not encode genes involved in the classical elements of oxygen metabolism (e.g., catalases, superoxide dismutase, glutathione biosynthesis, oxidative phosphorylation). Instead, we found that genes encoding bacterial-like oxidoreductases were upregulated in response to oxygen stress. Phylogenetic analysis revealed some of these oxygen-responsive genes (e.g., nadh oxidase, rubrerythrin, superoxide reductase) are rare in eukaryotes and likely derived from lateral gene transfer (LGT) events into diplomonads from prokaryotes. Unexpectedly, we observed that many host evasion- and invasion-related genes were also upregulated under oxidative stress suggesting that oxygen might be an important signal for pathogenesis. Conclusion: While oxygen is toxic for related organisms, such as G. intestinalis, we find that oxygen is likely a gene induction signal for host invasion- and evasion-related pathways in S. salmonicida. These data provide the first molecular evidence for how S. salmonicida could tolerate oxic host environments and demonstrate how LGT can have a profound impact on the biology of anaerobic parasites

    Draft genomes of Blastocystis subtypes from human samples of Colombia.

    No full text
    Funder: Universidad del RosarioBACKGROUND: Blastocystis is one of the most common eukaryotic microorganisms colonizing the intestines of both humans and animals, but the conditions under which it may be a pathogen are unclear. METHODS: To study the genomic characteristics of circulating subtypes (ST) in Colombia, we established nine xenic cultures from Blastocystis isolated from human fecal samples, we identified 10 different subtypes, since one sample had a mixed infection. Thus, the genomes of the subtypes ST1 (n = 3), ST2 (n = 1), ST3 (n = 2), ST6 (n = 1), ST7 (n = 1), and ST8 (n = 2) were sequenced using Illumina and Oxford Nanopore Technologies (ONT). RESULTS: Analyses of these draft nuclear genomes indicated remarkable diversity in terms of genome size and guanine-cytosine (GC) content among the compared STs. Illumina sequencing-only draft genomes contained 824 to 2077 scaffolds, with total genome size ranging from 12 to 13.2 Mb and N50 values ranging from 10,585 to 29,404 base pairs (bp). The genome of one ST1 isolate was sequenced using ONT. This assembly was more contiguous, with a size of 20 million base pairs (Mb) spread over 116 scaffolds, and an N50 of 248,997 bp. CONCLUSION: This work represents one of the few large-scale comparative genomic analyses of Blastocystis isolates, providing an additional glimpse into its genomic diversity

    Anaeramoebae are a divergent lineage of eukaryotes that shed light on the transition from anaerobic mitochondria to hydrogenosomes

    No full text
    Discoveries of diverse microbial eukaryotes and their inclusion in comprehensive phylogenomic analyses have crucially re-shaped the eukaryotic tree of life in the 21st century.1 At the deepest level, eukaryotic diversity comprises 9-10 "supergroups." One of these supergroups, the Metamonada, is particularly important to our understanding of the evolutionary dynamics of eukaryotic cells, including the remodeling of mitochondrial function. All metamonads thrive in low-oxygen environments and lack classical aerobic mitochondria, instead possessing mitochondrion-related organelles (MROs) with metabolisms that are adapted to low-oxygen conditions. These MROs lack an organellar genome, do not participate in the Krebs cycle and oxidative phosphorylation,2 and often synthesize ATP by substrate-level phosphorylation coupled to hydrogen production.3,4 The events that occurred during the transition from an oxygen-respiring mitochondrion to a functionally streamlined MRO early in metamonad evolution remain largely unknown. Here, we report transcriptomes of two recently described, enigmatic, anaerobic protists from the genus Anaeramoeba.5 Using phylogenomic analysis, we show that these species represent a divergent, phylum-level lineage in the tree of metamonads, emerging as a sister group of the Parabasalia and reordering the deep branching order of the metamonad tree. Metabolic reconstructions of the Anaeramoeba MROs reveal many "classical" mitochondrial features previously not seen in metamonads, including a disulfide relay import system, propionate production, and amino acid metabolism. Our findings suggest that the cenancestor of Metamonada likely had MROs with more classical mitochondrial features than previously anticipated and demonstrate how discoveries of novel lineages of high taxonomic rank continue to transform our understanding of early eukaryote evolution
    corecore