6 research outputs found
Interactions between species introduce spurious associations in microbiome studies
Microbiota contribute to many dimensions of host phenotype, including
disease. To link specific microbes to specific phenotypes, microbiome-wide
association studies compare microbial abundances between two groups of samples.
Abundance differences, however, reflect not only direct associations with the
phenotype, but also indirect effects due to microbial interactions. We found
that microbial interactions could easily generate a large number of spurious
associations that provide no mechanistic insight. Using techniques from
statistical physics, we developed a method to remove indirect associations and
applied it to the largest dataset on pediatric inflammatory bowel disease. Our
method corrected the inflation of p-values in standard association tests and
showed that only a small subset of associations is directly linked to the
disease. Direct associations had a much higher accuracy in separating cases
from controls and pointed to immunomodulation, butyrate production, and the
brain-gut axis as important factors in the inflammatory bowel disease.Comment: 4 main text figures, 15 supplementary figures (i.e appendix) and 6
supplementary tables. Overall 49 pages including reference
An introduction to the maximum entropy approach and its application to inference problems in biology
A cornerstone of statistical inference, the maximum entropy framework is being increasingly applied to construct descriptive and predictive models of biological systems, especially complex biological networks, from large experimental data sets. Both its broad applicability and the success it obtained in different contexts hinge upon its conceptual simplicity and mathematical soundness. Here we try to concisely review the basic elements of the maximum entropy principle, starting from the notion of âentropyâ, and describe its usefulness for the analysis of biological systems. As examples, we focus specifically on the problem of reconstructing gene interaction networks from expression data and on recent work attempting to expand our system-level understanding of bacterial metabolism. Finally, we highlight some extensions and potential limitations of the maximum entropy approach, and point to more recent developments that are likely to play a key role in the upcoming challenges of extracting structures and information from increasingly rich, high-throughput biological data
Network topology and community function in spatial microbial communities
Complex communities of microbes act collectively to regulate human health, provide sources of clean energy, and ripen aromatic cheese. The efficient functioning of these communities can be directly related to competitive and cooperative interactions between
species. Physical constraints and local environment affect the stability of these interactions. Here we explore the role of spatial habitat and interaction networks in microbial ecology and human disease.
In the first part of the dissertation, we model mutualism to understand how spatial microbial communities survive number fluctuations in physical habitats. We explicitly account for the production, consumption, and diffusion of public goods in a two-species microbial community. We show that increased sharing of nutrients breaks down coexistence, and that species may benefit from making slower-diffusing nutrients. In multi-species communities, indirect and higher order interactions may affect community function. We find that the requirement for spatial proximity severely restricts the network of possible microbial interactions. While cooperation between two
species is stable, higher-order mutualism requiring three or more species succumbs easily to number fluctuations. Additional cyclic or reciprocal interactions between pairs can stabilize multi-species communities.
Inter-species interactions also affect human health via the human microbiome: microbial communities in the gut, lungs and skin. In the second part of the dissertation, we use machine learning and statistics to establish links between microbiota abundance and composition, and the incidence of chronic diseases. We study the gut fungal profile to probe the effects of diet and fungal dysbiosis in a cohort of Saudi children with Crohn's disease.
While statistical microbiome studies established that each disease phenotype is associated with a distinct state of intestinal dysbiosis, they often produced conflicting results and identified a very large number of microbes associated with disease. We show that a handful of taxa could drive the dynamics of ecosystem-level abundance changes due to strong inter-species interactions. Using maximum entropy methods, we propose a simple statistical approach (Direct Association Analysis or DAA) to account for interspecific interactions. When applied to the largest dataset on IBD, DAA detects a small subset of associations directly linked to the disease, avoids p-value
inflation and identifies most predictive features of the microbiome
Variable habitat conditions drive species covariation in the human microbiota
International audienceTwo species with similar resource requirements respond in a characteristic way to variations in their habitatâtheir abundances rise and fall in concert. We use this idea to learn how bacterial populations in the microbiota respond to habitat conditions that vary from person-to-person across the human population. Our mathematical framework shows that habitat fluctuations are sufficient for explaining intra-bodysite correlations in relative species abundances from the Human Microbiome Project. We explicitly show that the relative abundances of closely related species are positively correlated and can be predicted from taxonomic relationships. We identify a small set of functional pathways related to metabolism and maintenance of the cell wall that form the basis of a common resource sharing niche space of the human microbiota