55 research outputs found

    Cutoffs and k-mers: implications from a transcriptome study in allopolyploid plants

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Transcriptome analysis is increasingly being used to study the evolutionary origins and ecology of non-model plants. One issue for both transcriptome assembly and differential gene expression analyses is the common occurrence in plants of hybridisation and whole genome duplication (WGD) and hybridization resulting in allopolyploidy. The divergence of duplicated genes following WGD creates near identical homeologues that can be problematic for <it>de novo </it>assembly and also reference based assembly protocols that use short reads (35 - 100 bp).</p> <p>Results</p> <p>Here we report a successful strategy for the assembly of two transcriptomes made using 75 bp Illumina reads from <it>Pachycladon fastigiatum </it>and <it>Pachycladon cheesemanii</it>. Both are allopolyploid plant species (2n = 20) that originated in the New Zealand Alps about 0.8 million years ago. In a systematic analysis of 19 different coverage cutoffs and 20 different k-mer sizes we showed that i) none of the genes could be assembled across all of the parameter space ii) assembly of each gene required an optimal set of parameter values and iii) these parameter values could be explained in part by different gene expression levels and different degrees of similarity between genes.</p> <p>Conclusions</p> <p>To obtain optimal transcriptome assemblies for allopolyploid plants, k-mer size and k-mer coverage need to be considered simultaneously across a broad parameter space. This is important for assembling a maximum number of full length ESTs and for avoiding chimeric assemblies of homeologous and paralogous gene copies.</p

    A Computational Model of Bacterial Population Dynamics in Gastrointestinal Yersinia enterocolitica Infections in Mice.

    Get PDF
    The complex interplay of a pathogen with its virulence and fitness factors, the host's immune response, and the endogenous microbiome determine the course and outcome of gastrointestinal infection. The expansion of a pathogen within the gastrointestinal tract implies an increased risk of developing severe systemic infections, especially in dysbiotic or immunocompromised individuals. We developed a mechanistic computational model that calculates and simulates such scenarios, based on an ordinary differential equation system, to explain the bacterial population dynamics during gastrointestinal infection. For implementing the model and estimating its parameters, oral mouse infection experiments with the enteropathogen, Yersinia enterocolitica (Ye), were carried out. Our model accounts for specific pathogen characteristics and is intended to reflect scenarios where colonization resistance, mediated by the endogenous microbiome, is lacking, or where the immune response is partially impaired. Fitting our data from experimental mouse infections, we can justify our model setup and deduce cues for further model improvement. The model is freely available, in SBML format, from the BioModels Database under the accession number MODEL2002070001

    Systematic Error in Seed Plant Phylogenomics

    Get PDF
    Resolving the closest relatives of Gnetales has been an enigmatic problem in seed plant phylogeny. The problem is known to be difficult because of the extent of divergence between this diverse group of gymnosperms and their closest phylogenetic relatives. Here, we investigate the evolutionary properties of conifer chloroplast DNA sequences. To improve taxon sampling of Cupressophyta (non-Pinaceae conifers), we report sequences from three new chloroplast (cp) genomes of Southern Hemisphere conifers. We have applied a site pattern sorting criterion to study compositional heterogeneity, heterotachy, and the fit of conifer chloroplast genome sequences to a general time reversible + G substitution model. We show that non-time reversible properties of aligned sequence positions in the chloroplast genomes of Gnetales mislead phylogenetic reconstruction of these seed plants. When 2,250 of the most varied sites in our concatenated alignment are excluded, phylogenetic analyses favor a close evolutionary relationship between the Gnetales and Pinaceae—the Gnepine hypothesis. Our analytical protocol provides a useful approach for evaluating the robustness of phylogenomic inferences. Our findings highlight the importance of goodness of fit between substitution model and data for understanding seed plant phylogeny

    Chips and tags suggest plant-environment interactions differ for two alpine Pachycladon species

    Get PDF
    BACKGROUND: Expression profiling has been proposed as a means for screening non-model organisms in their natural environments to identify genes potentially important in adaptive diversification. Tag profiling using high throughput sequencing is a relatively low cost means of expression profiling with deep coverage. However the extent to which very short cDNA sequences can be effectively used in screening for candidate genes is unclear. Here we investigate this question using an evolutionarily distant as well as a closely related transcriptome for referencing tags. We do this by comparing differentially expressed genes and processes between two closely related allopolyploid species of Pachycladon which have distinct altitudinal preferences in the New Zealand Southern Alps. We validate biological inferences against earlier microarray analyses. RESULTS: Statistical and gene annotation enrichment analyses of tag profiles identified more differentially expressed genes of potential adaptive significance than previous analyses of array-based expression profiles. These include genes involved in glucosinolate metabolism, flowering time, and response to cold, desiccation, fungi and oxidation. In addition, despite the short length of 20mer tags, we were able to infer patterns of homeologous gene expression for 700 genes in our reference library of 7,128 full-length Pachycladon ESTs. We also demonstrate that there is significant information loss when mapping tags to the non-conspecific reference transcriptome of A. thaliana as opposed to P. fastigiatum ESTs but also describe mapping strategies by which the larger collection of A. thaliana ESTs can be used as a reference. CONCLUSION: When coupled with a reference transcriptome generated using RNA-seq, tag sequencing offers a promising approach for screening natural populations and identifying candidate genes of potential adaptive significance. We identify computational issues important for the successful application of tag profiling in a non-model allopolyploid plant species

    A Longitudinal Study of the Feline Faecal Microbiome Identifies Changes into Early Adulthood Irrespective of Sexual Development.

    No full text
    Companion animals provide an excellent model for studies of the gut microbiome because potential confounders such as diet and environment can be more readily controlled for than in humans. Additionally, domestic cats and dogs are typically neutered early in life, enabling an investigation into the potential effect of sex hormones on the microbiome. In a longitudinal study to investigate the potential effects of neutering, neutering age and gender on the gut microbiome during growth, the faeces of kittens (16 male, 14 female) were sampled at 18, 30 and 42 weeks of age. DNA was shotgun sequenced on the Illumina platform and sequence reads were annotated for taxonomy and function by comparison to a database of protein coding genes. In a statistical analysis of diversity, taxonomy and functional potential of the microbiomes, age was identified as the only factor with significant associations. No significant effects were detected for gender, neutering, or age when neutered (19 or 31 weeks). At 18 weeks of age the microbiome was dominated by the genera Lactobacillus and Bifidobacterium (35% and 20% average abundance). Structural and functional diversity was significantly increased by week 30 but there was no further significant increase. At 42 weeks of age the most abundant genera were Bacteroides (16%), Prevotella (14%) and Megasphaera (8%). Significant differences in functional potential included an enrichment for genes in energy metabolism (carbon metabolism and oxidative phosphorylation) and depletion in cell motility (flagella and chemotaxis). We conclude that the feline faecal microbiome is predominantly determined by age when diet and environment are controlled for. We suggest this finding may also be informative for studies of the human microbiome, where control over such factors is usually limited

    Early canine plaque biofilms: characterization of key bacterial interactions involved in initial colonization of enamel.

    No full text
    Periodontal disease (PD) is a significant problem in dogs affecting between 44% and 63.6% of the population. The main etiological agent for PD is plaque, a microbial biofilm that colonizes teeth and causes inflammation of the gingiva. Understanding how this biofilm initiates on the tooth surface is of central importance in developing interventions against PD. Although the stages of plaque development on human teeth have been well characterized little is known about how canine plaque develops. Recent studies of the canine oral microbiome have revealed distinct differences between the canine and human oral environments and the bacterial communities they support, particularly with respect to healthy plaque. These differences mean knowledge about the nature of plaque formation in humans may not be directly translatable to dogs. The aim of this study was to identify the bacterial species important in the early stages of canine plaque formation in vivo and then use isolates of these species in a laboratory biofilm model to develop an understanding of the sequential processes which take place during the initial colonization of enamel. Supra-gingival plaque samples were collected from 12 dogs at 24 and 48 hour time points following a full mouth descale and polish. Pyrosequencing of the 16S rDNA identified 134 operational taxonomic units after statistical analysis. The species with the highest relative abundance were Bergeyella zoohelcum, Neisseria shayeganii and a Moraxella species. Streptococcal species, which tend to dominate early human plaque biofilms, had very low relative abundance. In vitro testing of biofilm formation identified five primary colonizer species, three of which belonged to the genus Neisseria. Using these pioneer bacteria as a starting point, viable two and three species communities were developed. Combining in vivo and in vitro data has led us to construct novel models of how the early canine plaque biofilm develops
    corecore