698 research outputs found

    A review of methods for the reconstruction and analysis of integrated genome-scale models of metabolism and regulation

    Get PDF
    The current survey aims to describe the main methodologies for extending the reconstruction and analysis of genome-scale metabolic models and phenotype simulation with Flux Balance Analysis mathematical frameworks, via the integration of Transcriptional Regulatory Networks and/or gene expression data. Although the surveyed methods are aimed at improving phenotype simulations obtained from these models, the perspective of reconstructing integrated genome-scale models of metabolism and gene expression for diverse prokaryotes is still an open challenge.This study was supported by the Portuguese Foundation for Science and Technology (FCT) under the scope of the strategic funding of UIDB/04 469/2020 unit and BioTecNorte operation (NORTE-01-0145-FEDER-000004) funded by the European Regional Development Fund under the scope of Norte2020 -Programa Operacional Regional do Norte. Fernando Cruz holds a doctoral fellowship (SFRH/BD/139198/2018) funded by the FCT. This study was supported by the European Commission through project SHIKIFACTORY100 -Modular cell factories for the production of 100 compounds from the shikimate pathway (Reference 814408). The submitted manuscript has been created by UChicago Argonne, LLC as Operator of Argonne National Laboratory (`Argonne') under Contract No. DE-AC02-06CH11357 with the U.S. Department of Energy. The U.S. Government retains for itself, and others acting on its behalf, a paid-up, nonexclusive, irrevocable worldwide license in said article to reproduce, prepare derivative works, distribute copies to the public, and perform publicly and display publicly, by or on behalf of the Government. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan.info:eu-repo/semantics/publishedVersio

    Systems analysis of host-parasite interactions.

    Get PDF
    Parasitic diseases caused by protozoan pathogens lead to hundreds of thousands of deaths per year in addition to substantial suffering and socioeconomic decline for millions of people worldwide. The lack of effective vaccines coupled with the widespread emergence of drug-resistant parasites necessitates that the research community take an active role in understanding host-parasite infection biology in order to develop improved therapeutics. Recent advances in next-generation sequencing and the rapid development of publicly accessible genomic databases for many human pathogens have facilitated the application of systems biology to the study of host-parasite interactions. Over the past decade, these technologies have led to the discovery of many important biological processes governing parasitic disease. The integration and interpretation of high-throughput -omic data will undoubtedly generate extraordinary insight into host-parasite interaction networks essential to navigate the intricacies of these complex systems. As systems analysis continues to build the foundation for our understanding of host-parasite biology, this will provide the framework necessary to drive drug discovery research forward and accelerate the development of new antiparasitic therapies

    Novel Methods for Metagenomic Analysis

    Get PDF
    By sampling the genetic content of microbes at the nucleotide level, metagenomics has rapidly established itself as the standard in characterizing the taxonomic diversity and functional capacity of microbial populations throughout nature. The decreasing cost of sequencing technologies and the simultaneous increase of throughput per run has given scientists the ability to deeply sample highly diverse communities on a reasonable budget. The Human Microbiome Project is representative of the flood of sequence data that will arrive in the coming years. Despite these advancements, there remains the significant challenge of analyzing massive metagenomic datasets to make appropriate biological conclusions. This dissertation is a collection of novel methods developed for improved analysis of metagenomic data: (1) We begin with Figaro, a statistical algorithm that quickly and accurately infers and trims vector sequence from large Sanger-based read sets without prior knowledge of the vector used in library construction. (2) Next, we perform a rigorous evaluation of methodologies used to cluster environmental 16S rRNA sequences into species-level operational taxonomic units, and discover that many published studies utilize highly stringent parameters, resulting in overestimation of microbial diversity. (3) To assist in comparative metagenomics studies, we have created Metastats, a robust statistical methodology for comparing large-scale clinical datasets with up to thousands of subjects. Given a collection of annotated metagenomic features (e.g. taxa, COGs, or pathways), Metastats determines which features are differentially abundant between two populations. (4) Finally, we report on a new methodology that employs the generalized Lotka-Volterra model to infer microbe-microbe interactions from longitudinal 16S rRNA data. It is our hope that these methods will enhance standard metagenomic analysis techniques to provide better insight into the human microbiome and microbial communities throughout our world. To assist metagenomics researchers and those developing methods, all software described in this thesis is open-source and available online

    Proteome characterizations of microbial systems using MS-based experimental and informatics approaches to examine key metabolic pathways, proteins of unknown function, and phenotypic adaptation

    Get PDF
    Microbes express complex phenotypes and coordinate activities to build microbial communities. Recent work has focused on understanding the ability of microbial systems to efficiently utilize cellulosic biomass to produce bioenergy-related products. In order to maximize the yield of these bioenergy-related products from a microbial system, it is necessary to understand the molecular mechanisms.The ability of mass spectrometry to precisely identify thousands of proteins from a bacterial source has established mass spectrometry-based proteomics as an indispensable tool for various biological disciplines. This dissertation developed and optimized various proteomics experimental and informatic protocols, and integrated the resulting data with metabolomics, transcriptomics, and genomics in order to understand the systems biology of bio-energy relevant organisms. Integration of these various omics technologies led to an improved understanding of microbial cell-to-cell communication in response to external stimuli, microbial adaptation during deconstruction of lignocellulosic biomass and proteome diversity when an organism is subjected to different growth conditions.Integrated omics revealed Clostridium thermocellum\u27s accumulate long-chain, branched fatty acids over time in response to cytotoxic inhibitors released during the deconstruction and utilization of switchgrass. A striking feature implies a restructuring of C. thermocellum\u27s cellular membrane as the culture progresses. The membrane remodulation was further examined in a study involving the swarming and swimming phenotypes of Paenibacillus polymyxa. The possible roles of phospholipids, hydrolytic enzymes, surfactin, flagellar assembly, chemotaxis and glycerol metabolism in swarming motility were investigated by integrating lipidomics with proteomics.Extracellular proteome analysis of Caldicellulosiruptor bescii revealed secretome plasticity based on the complexity (mono-/disaccharides vs. polysaccharides) and type of carbon (C5 vs. C6) available to the microorganism. This study further opened the avenue for research to characterize proteins of unknown function (PUFs) specific to growth conditions.To gain a better understanding of the possible functions of PUFs in C. thermocellum, a time course analysis of C. thermocellum was conducted. Based on the concept of guilt-by-association, protein intensities and their co-expressions were used to tease out the functional aspect of PUFs. Clustering trends and network analysis were used to infer potential functions of PUFs. Selected PUFs were further interrogated by the use of phylogeny and structural modeling

    MOLECULAR AND ECOLOGICAL ASPECTS OF THE INTERACTIONS BETWEEN \u3ci\u3eAUREOCOCCUS ANOPHAGEFFERENS\u3c/i\u3e AND ITS GIANT VIRUS

    Get PDF
    Viruses are increasingly being recognized as an important biotic component of all ecosystems including agents that control the rapid ecological events that are harmful algal blooms (HABS). Aureococcus anophagefferens is a pelagophyte which causes recurrent ecosystem devastating brown tide blooms along the east coast of the USA and has recently spread to China and South Africa. It has been suggested that a large virus (AaV) is possibly an important agent for demise of brown tide blooms. This observation is consistent with the recognition of a number of other giant viruses modulating algal blooms in marine systems. In this dissertation, we investigated both the molecular underpinnings of Aureococcus-AaV interactions and the dynamics of AaV and the associated viral community in situ. We determined the genome sequence and phylogenetic history of AaV using high throughput sequencing approach and revealed it’s intertwined evolutionary history with the host and other organisms. Building upon the available genome of AaV and its host, we took an RNA-seq approach to provide insights on the physiological state of the AaV-infected Aureococcus ‘virocell’ that is geared towards virus production. In situ activity of AaV was detected by targeted amplicon and high throughput community RNA sequencing (metatranscriptomics) from Quantuck Bay, NY, a site with recurrent brown tide blooms. AaV and associated giant algal viruses in the Mimiviridae clade were found to respond to environmental changes, indicating that this newly recognized phylogenetic group is an important contributor to the eukaryotic phytoplankton dynamics. Analyzing time series metatranscriptomics from two distinct coastal sites recovered diverse viruses infecting microeukaryotes (including AaV) as part of interacting networks of viruses and microeukaryotes. Results from these studies testify AaV as an important factor for brown tide bloom demise, reveals the molecular underpinnings of AaV-host interactions and establishes the ecological relevance of Mimivirus-like algal viruses. We also provide foundation for using metatranscriptomics as an important tool in marine virus ecology – capable of recovering associations among coexisting marine microeukaryotes and viruses

    Predicting Cellular Growth from Gene Expression Signatures

    Get PDF
    Maintaining balanced growth in a changing environment is a fundamental systems-level challenge for cellular physiology, particularly in microorganisms. While the complete set of regulatory and functional pathways supporting growth and cellular proliferation are not yet known, portions of them are well understood. In particular, cellular proliferation is governed by mechanisms that are highly conserved from unicellular to multicellular organisms, and the disruption of these processes in metazoans is a major factor in the development of cancer. In this paper, we develop statistical methodology to identify quantitative aspects of the regulatory mechanisms underlying cellular proliferation in Saccharomyces cerevisiae. We find that the expression levels of a small set of genes can be exploited to predict the instantaneous growth rate of any cellular culture with high accuracy. The predictions obtained in this fashion are robust to changing biological conditions, experimental methods, and technological platforms. The proposed model is also effective in predicting growth rates for the related yeast Saccharomyces bayanus and the highly diverged yeast Schizosaccharomyces pombe, suggesting that the underlying regulatory signature is conserved across a wide range of unicellular evolution. We investigate the biological significance of the gene expression signature that the predictions are based upon from multiple perspectives: by perturbing the regulatory network through the Ras/PKA pathway, observing strong upregulation of growth rate even in the absence of appropriate nutrients, and discovering putative transcription factor binding sites, observing enrichment in growth-correlated genes. More broadly, the proposed methodology enables biological insights about growth at an instantaneous time scale, inaccessible by direct experimental methods. Data and tools enabling others to apply our methods are available at http://function.princeton.edu/growthrate

    On the effects of alternative optima in context-specific metabolic model predictions

    Full text link
    Recent methodological developments have facilitated the integration of high-throughput data into genome-scale models to obtain context-specific metabolic reconstructions. A unique solution to this data integration problem often may not be guaranteed, leading to a multitude of context-specific predictions equally concordant with the integrated data. Yet, little attention has been paid to the alternative optima resulting from the integration of context-specific data. Here we present computational approaches to analyze alternative optima for different context-specific data integration instances. By using these approaches on metabolic reconstructions for the leaf of Arabidopsis thaliana and the human liver, we show that the analysis of alternative optima is key to adequately evaluating the specificity of the predictions in particular cellular contexts. While we provide several ways to reduce the ambiguity in the context-specific predictions, our findings indicate that the existence of alternative optimal solutions warrant caution in detailed context-specific analyses of metabolism
    • …
    corecore