289 research outputs found

    Copy number variation signature to predict human ancestry

    Get PDF
    Abstract Background Copy number variations (CNVs) are genomic structural variants that are found in healthy populations and have been observed to be associated with disease susceptibility. Existing methods for CNV detection are often performed on a sample-by-sample basis, which is not ideal for large datasets where common CNVs must be estimated by comparing the frequency of CNVs in the individual samples. Here we describe a simple and novel approach to locate genome-wide CNVs common to a specific population, using human ancestry as the phenotype. Results We utilized our previously published Genome Alteration Detection Analysis (GADA) algorithm to identify common ancestry CNVs (caCNVs) and built a caCNV model to predict population structure. We identified a 73 caCNV signature using a training set of 225 healthy individuals from European, Asian, and African ancestry. The signature was validated on an independent test set of 300 individuals with similar ancestral background. The error rate in predicting ancestry in this test set was 2% using the 73 caCNV signature. Among the caCNVs identified, several were previously confirmed experimentally to vary by ancestry. Our signature also contains a caCNV region with a single microRNA (MIR270), which represents the first reported variation of microRNA by ancestry. Conclusions We developed a new methodology to identify common CNVs and demonstrated its performance by building a caCNV signature to predict human ancestry with high accuracy. The utility of our approach could be extended to large case–control studies to identify CNV signatures for other phenotypes such as disease susceptibility and drug response

    Lead Modulates trans- and cis-Expression Quantitative Trait Loci (eQTLs) in Drosophila melanogaster Heads

    Get PDF
    Lead exposure has long been one of the most important topics in global public health because it is a potent developmental neurotoxin. Here, an eQTL analysis, which is the genome-wide association analysis of genetic variants with gene expression, was performed. In this analysis, the male heads of 79 Drosophila melanogaster inbred lines from Drosophila Synthetic Population Resource (DSPR) were treated with or without developmental exposure, from hatching to adults, to 250 ÎŒM lead acetate [Pb(C2H3O2)2]. The goal was to identify genomic intervals that influence the gene-expression response to lead. After detecting 1798 cis-eQTLs and performing an initial trans-eQTL analysis, we focused our analysis on lead-sensitive “trans-eQTL hotspots,” defined as genomic regions that are associated with a cluster of genes in a lead-dependent manner. We noticed that the genes associated with one of the 14 detected trans-eQTL hotspots, Chr 2L: 6,250,000 could be roughly divided into two groups based on their differential expression profile patterns and different categories of function. This trans-eQTL hotspot validates one identified in a previous study using different recombinant inbred lines. The expression of all the associated genes in the trans-eQTL hotspot was visualized with hierarchical clustering analysis. Besides the overall expression profile patterns, the heatmap displayed the segregation of differential parental genetic contributions. This suggested that trans-regulatory regions with different genetic contributions from the parental lines have significantly different expression changes after lead exposure. We believe this study confirms our earlier study, and provides important insights to unravel the genetic variation in lead susceptibility in Drosophila model

    Copy number variation in the Framingham Heart Study

    Get PDF
    In this paper we test for association between copy number variation and diabetes in a subset of individuals from the Framingham Heart Study. We used the 500 k SNP data and called copy number variation using two algorithms: the genome alteration detection algorithm of Pique-Regi et al. and the software Golden Helix. We then tested for association between copy number and diabetes using a gene-based analysis. Our results show little evidence of association between copy number and diabetes status. Furthermore, our results indicate a relatively poor level of agreement between copy number calls resulting from the two programs. We then examined potential causes for this difference in results and the implications for future studies

    R-Gada: a fast and flexible pipeline for copy number analysis in association studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genome-wide association studies (GWAS) using Copy Number Variation (CNV) are becoming a central focus of genetic research. CNVs have successfully provided target genome regions for some disease conditions where simple genetic variation (i.e., SNPs) has previously failed to provide a clear association.</p> <p>Results</p> <p>Here we present a new R package, that integrates: (i) data import from most common formats of Affymetrix, Illumina and aCGH arrays; (ii) a fast and accurate segmentation algorithm to call CNVs based on Genome Alteration Detection Analysis (GADA); and (iii) functions for displaying and exporting the Copy Number calls, identification of recurrent CNVs, multivariate analysis of population structure, and tools for performing association studies. Using a large dataset containing 270 HapMap individuals (Affymetrix Human SNP Array 6.0 Sample Dataset) we demonstrate a flexible pipeline implemented with the package. It requires less than one minute per sample (3 million probe arrays) on a single core computer, and provides a flexible parallelization for very large datasets. Case-control data were generated from the HapMap dataset to demonstrate a GWAS analysis.</p> <p>Conclusions</p> <p>The package provides the tools for creating a complete integrated pipeline from data normalization to statistical association. It can effciently handle a massive volume of data consisting of millions of genetic markers and hundreds or thousands of samples with very accurate results.</p

    Genetic and Transcriptional Analysis of Human Host Response to Healthy Gut Microbiota

    Get PDF
    Many studies have demonstrated the importance of the gut microbiota in healthy and disease states. However, establishing the causality of host-microbiota interactions in humans is still challenging. Here, we describe a novel experimental system to define the transcriptional response induced by the microbiota for human cells and to shed light on the molecular mechanisms underlying host-gut microbiota interactions. In primary human colonic epithelial cells, we identified over 6,000 genes whose expression changed at various time points following coculturing with the gut microbiota of a healthy individual. Among the differentially expressed genes we found a 1.8-fold enrichment of genes associated with diseases that have been previously linked to the microbiome, such as obesity and colorectal cancer. In addition, our experimental system allowed us to identify 87 host single nucleotide polymorphisms (SNPs) that show allele-specific expression in 69 genes. Furthermore, for 12 SNPs in 12 different genes, allele-specific expression is conditional on the exposure to the microbiota. Of these 12 genes, 8 have been associated with diseases linked to the gut microbiota, specifically colorectal cancer, obesity, and type 2 diabetes. Our study demonstrates a scalable approach to study host-gut microbiota interactions and can be used to identify putative mechanisms for the interplay between host genetics and the microbiota in health and disease

    The amniotic fluid cell-free transcriptome in spontaneous preterm labor

    Get PDF
    The amniotic fluid (AF) cell-free RNA was shown to reflect physiological and pathological processes in pregnancy, but its value in the prediction of spontaneous preterm delivery is unknown. Herein we profiled cell-free RNA in AF samples collected from women who underwent transabdominal amniocentesis after an episode of spontaneous preterm labor and subsequently delivered within 24 h (n = 10) or later (n = 28) in gestation. Expression of known placental single-cell RNA-Seq signatures was quantified in AF cell-free RNA and compared between the groups. Random forest models were applied to predict time-to-delivery after amniocentesis. There were 2385 genes differentially expressed in AF samples of women who delivered within 24 h of amniocentesis compared to gestational age-matched samples from women who delivered after 24 h of amniocentesis. Genes with cell-free RNA changes were associated with immune and inflammatory processes related to the onset of labor, and the expression of placental single-cell RNA-Seq signatures of immune cells was increased with imminent delivery. AF transcriptomic prediction models captured these effects and predicted delivery within 24 h of amniocentesis (AUROC = 0.81). These results may inform the development of biomarkers for spontaneous preterm birth

    A fast and accurate method to detect allelic genomic imbalances underlying mosaic rearrangements using SNP array data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Mosaicism for copy number and copy neutral chromosomal rearrangements has been recently identified as a relatively common source of genetic variation in the normal population. However its prevalence is poorly defined since it has been only studied systematically in one large-scale study and by using non optimal <it>ad-hoc </it>SNP array data analysis tools, uncovering rather large alterations (> 1 Mb) and affecting a high proportion of cells. Here we propose a novel methodology, Mosaic Alteration Detection-MAD, by providing a software tool that is effective for capturing previously described alterations as wells as new variants that are smaller in size and/or affecting a low percentage of cells.</p> <p>Results</p> <p>The developed method identified all previously known mosaic abnormalities reported in SNP array data obtained from controls, bladder cancer and HapMap individuals. In addition MAD tool was able to detect new mosaic variants not reported before that were smaller in size and with lower percentage of cells affected. The performance of the tool was analysed by studying simulated data for different scenarios. Our method showed high sensitivity and specificity for all assessed scenarios.</p> <p>Conclusions</p> <p>The tool presented here has the ability to identify mosaic abnormalities with high sensitivity and specificity. Our results confirm the lack of sensitivity of former methods by identifying new mosaic variants not reported in previously utilised datasets. Our work suggests that the prevalence of mosaic alterations could be higher than initially thought. The use of appropriate SNP array data analysis methods would help in defining the human genome mosaic map.</p

    RNA Sequencing Reveals Diverse Functions of Amniotic Fluid Neutrophils and Monocytes/Macrophages in Intra-Amniotic Infection

    Get PDF
    Intra-amniotic infection, the invasion of microbes into the amniotic cavity resulting in inflammation, is a clinical condition that can lead to adverse pregnancy outcomes for the mother and fetus as well as severe long-term neonatal morbidities. Despite much research focused on the consequences of intra-amniotic infection, there remains little knowledge about the innate immune cells that respond to invading microbes. We performed RNA-seq of sorted amniotic fluid neutrophils and monocytes/macrophages from women with intra-amniotic infection to determine the transcriptomic differences between these innate immune cells. Further, we sought to identify specific transcriptomic pathways that were significantly altered by the maternal or fetal origin of amniotic fluid neutrophils and monocytes/macrophages, the presence of a severe fetal inflammatory response, and pregnancy outcome (i.e., preterm or term delivery). We show that significant transcriptomic differences exist between amniotic fluid neutrophils and monocytes/macrophages from women with intra-amniotic infection, indicating the distinct roles these cells play. The transcriptome of amniotic fluid immune cells varies based on their maternal or fetal origin, and the significant transcriptomic differences between fetal and maternal monocytes/macrophages imply that those of fetal origin exhibit impaired functions. Notably, transcriptomic changes in amniotic fluid monocytes/macrophages suggest that these immune cells collaborate with neutrophils in the trafficking of fetal leukocytes throughout the umbilical cord (i.e., funisitis). Finally, amniotic fluid neutrophils and monocytes/macrophages from preterm deliveries display enhanced transcriptional activity compared to those from term deliveries, highlighting the protective role of these cells during this vulnerable period. Collectively, these findings demonstrate the underlying complexity of local innate immune responses in women with intra-amniotic infection and provide new insights into the functions of neutrophils and monocytes/macrophages in the amniotic cavity. Drug Dosage: The authors and the publisher have exerted every effort to ensure that drug selection and dosage set forth in this text are in accord with current recommendations and practice at the time of publication. However, in view of ongoing research, changes in government regulations, and the constant flow of information relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug for any changes in indications and dosage and for added warnings and precautions. This is particularly important when the recommended agent is a new and/or infrequently employed drug. Disclaimer: The statements, opinions and data contained in this publication are solely those of the individual authors and contributors and not of the publishers and the editor(s). The appearance of advertisements or/and product references in the publication is not a warranty, endorsement, or approval of the products or services advertised or of their effectiveness, quality or safety. The publisher and the editor(s) disclaim responsibility for any injury to persons or property resulting from any ideas, methods, instructions or products referred to in the content or advertisements
    • 

    corecore