49 research outputs found

    Disparities in allele frequencies and population differentiation for 101 disease-associated single nucleotide polymorphisms between Puerto Ricans and Non-Hispanic Whites

    Get PDF
    BACKGROUND. Variations in gene allele frequencies can contribute to differences in the prevalence of some common complex diseases among populations. Natural selection modulates the balance in allele frequencies across populations. Population differentiation (FST) can evidence environmental selection pressures. Such genetic information is limited in Puerto Ricans, the second largest Hispanic ethnic group in the US, and a group with high prevalence of chronic disease. We determined allele frequencies and population differentiation for 101 single nucleotide polymorphisms (SNPs) in 30 genes involved in major metabolic and disease-relevant pathways in Puerto Ricans (n = 969, ages 45–75 years) and compared them to similarly aged non-Hispanic whites (NHW) (n = 597). RESULTS. Minor allele frequency (MAF) distributions for 45.5% of the SNPs assessed in Puerto Ricans were significantly different from those of NHW. Puerto Ricans carried risk alleles in higher frequency and protective alleles in lower frequency than NHW. Patterns of population differentiation showed that Puerto Ricans had SNPs with exceptional FST values in intronic, non-synonymous and promoter regions. NHW had exceptional FST values in intronic and promoter region SNPs only. CONCLUSION. These observations may serve to explain and broaden studies on the impact of gene polymorphisms on chronic diseases affecting Puerto Ricans.National Institutes of Health, National Institutes on Aging (P01AG02394, P01AG023394-SI); National Insitutes of Health (53-K06-5-10); US Department of Agriculture Research Service (58-1950-9-001, 58-1950-7-707); National Institutes of Health & Heart, Lung, and Blood Institute (U 01 HL72524, Genetic and Environmental Determinants of Triglycerides, HL54776

    Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts

    Get PDF
    Combining next-generation sequencing with capture of sequences from a relevant subset of a transcriptome produces an enhanced view of this subse

    Massively Parallel Sequencing of Human Urinary Exosome/Microvesicle RNA Reveals a Predominance of Non-Coding RNA

    Get PDF
    Intact RNA from exosomes/microvesicles (collectively referred to as microvesicles) has sparked much interest as potential biomarkers for the non-invasive analysis of disease. Here we use the Illumina Genome Analyzer to determine the comprehensive array of nucleic acid reads present in urinary microvesicles. Extraneous nucleic acids were digested using RNase and DNase treatment and the microvesicle inner nucleic acid cargo was analyzed with and without DNase digestion to examine both DNA and RNA sequences contained in microvesicles. Results revealed that a substantial proportion (∼87%) of reads aligned to ribosomal RNA. Of the non-ribosomal RNA sequences, ∼60% aligned to non-coding RNA and repeat sequences including LINE, SINE, satellite repeats, and RNA repeats (tRNA, snRNA, scRNA and srpRNA). The remaining ∼40% of non-ribosomal RNA reads aligned to protein coding genes and splice sites encompassing approximately 13,500 of the known 21,892 protein coding genes of the human genome. Analysis of protein coding genes specific to the renal and genitourinary tract revealed that complete segments of the renal nephron and collecting duct as well as genes indicative of the bladder and prostate could be identified. This study reveals that the entire genitourinary system may be mapped using microvesicle transcript analysis and that the majority of non-ribosomal RNA sequences contained in microvesicles is potentially functional non-coding RNA, which play an emerging role in cell regulation

    Strand-specific RNA sequencing reveals extensive regulated long antisense transcripts that are conserved across yeast species

    Get PDF
    Background Recent studies in budding yeast have shown that antisense transcription occurs at many loci. However, the functional role of antisense transcripts has been demonstrated only in a few cases and it has been suggested that most antisense transcripts may result from promiscuous bi-directional transcription in a dense genome. Results Here, we use strand-specific RNA sequencing to study anti-sense transcription in Saccharomyces cerevisiae. We detect 1,103 putative antisense transcripts expressed in mid-log phase growth, ranging from 39 short transcripts covering only the 3' UTR of sense genes to 145 long transcripts covering the entire sense open reading frame. Many of these antisense transcripts overlap sense genes that are repressed in mid-log phase and are important in stationary phase, stress response, or meiosis. We validate the differential regulation of 67 antisense transcripts and their sense targets in relevant conditions, including nutrient limitation and environmental stresses. Moreover, we show that several antisense transcripts and, in some cases, their differential expression have been conserved across five species of yeast spanning 150 million years of evolution. Divergence in the regulation of antisense transcripts to two respiratory genes coincides with the evolution of respiro-fermentation. Conclusions Our work provides support for a global and conserved role for antisense transcription in yeast gene regulation.Canadian Friends of the Hebrew UniversityHoward Hughes Medical InstituteHuman Frontier Science Program (Strasbourg, France)Burroughs Wellcome Fund (Career Award at the Scientific Interface)National Institutes of Health (U.S.). Pioneer AwardBroad Institute of MIT and HarvardU.S.-Israel Binational Science Foundation (BSF)National Human Genome Research Institute (U.S.)Alfred P. Sloan Foundatio

    Dietary Intake of n-6 Fatty Acids Modulates Effect of Apolipoprotein A5 Gene on Plasma Fasting Triglycerides, Remnant Lipoprotein Concentrations, and Lipoprotein Particle Size

    Get PDF
    Background— Apolipoprotein A5 gene (APOA5) variation is associated with plasma triglycerides (TGs). However, little is known about whether dietary fat modulates this association. Methods and Results— We investigated the interaction between APOA5 gene variation and dietary fat in determining plasma fasting TGs, remnant-like particle (RLP) concentrations, and lipoprotein particle size in 1001 men and 1147 women who were Framingham Heart Study participants. Polymorphisms –1131T>C and 56C>G, representing 2 independent haplotypes, were analyzed. Significant gene–diet interactions between the –1131T>C polymorphism and polyunsaturated fatty acid (PUFA) intake were found (PG polymorphism. The –1131C allele was associated with higher fasting TGs and RLP concentrations (P6% of total energy). No heterogeneity by sex was found. These interactions showed a dose-response effect when PUFA intake was considered as a continuous variable (P<0.01). Similar interactions were found for the sizes of VLDL and LDL particles. Only in carriers of the –1131C allele did the size of these particles increase (VLDL) or decrease (LDL) as PUFA intake increased (P<0.01). We further analyzed the effects of n-6 and n-3 fatty acids and found that the PUFA–APOA5 interactions were specific for dietary n-6 fatty acids. Conclusions— Higher n-6 (but not n-3) PUFA intake increased fasting TGs, RLP concentrations, and VLDL size and decreased LDL size in APOA5 –1131C carriers, suggesting that n-6 PUFA–rich diets are related to a more atherogenic lipid profile in these subjects.Corella Piquer, Maria Dolores, [email protected]

    Comprehensive Classification of Retinal Bipolar Neurons by Single-Cell Transcriptomics

    Get PDF
    Patterns of gene expression can be used to characterize and classify neuronal types. It is challenging, however, to generate taxonomies that fulfill the essential criteria of being comprehensive, harmonizing with conventional classification schemes, and lacking superfluous subdivisions of genuine types. To address these challenges, we used massively parallel single-cell RNA profiling and optimized computational methods on a heterogeneous class of neurons, mouse retinal bipolar cells (BCs). From a population of ∼25,000 BCs, we derived a molecular classification that identified 15 types, including all types observed previously and two novel types, one of which has a non-canonical morphology and position. We validated the classification scheme and identified dozens of novel markers using methods that match molecular expression to cell morphology. This work provides a systematic methodology for achieving comprehensive molecular classification of neurons, identifies novel neuronal types, and uncovers transcriptional differences that distinguish types within a class

    Development and evaluation of RNA-seq methods

    Get PDF
    This article is part of the supplement: Beyond the Genome: The true gene count, human evolution and disease genomicsRNA-seq provides insights at multiple levels into the transcription of the genome as it yields sequence, splicing and expression-level information. We have been developing and comparing a wide range of RNA-seq methods for their ability to annotate transcribed genomic regions, identify differences between normal and cancer states, and quantify mRNA expression levels. We will present results in two areas: (i) strand-specific RNA-seq; and (ii) RNA-seq starting from total RNA. Strand-specific RNA-seq is a powerful tool for novel transcript discovery and genome annotation because it enables the identification of the strand of origin for non-coding RNA and anti-sense RNA, as well as defining the ends of adjacent or overlapping transcripts transcribed in different directions. Using the well-annotated Saccharomyces cerevisiae transcriptome as a benchmark, we directly compared seven library construction protocols, including both published and our own novel methods. We found marked differences in strand-specificity, library complexity, evenness and continuity of coverage, agreement with known annotations, and accuracy for expression profiling. Weighing the performance and ease of conducting each method, we identified the dUTP second strand marking [1] and the Illumina RNA ligation methods as the leading protocols, with the former benefitting from the current availability of paired-end sequencing. Our analysis provides a comprehensive benchmark, and our computational pipeline is applicable for assessment of future protocols in other organisms. RNA-seq methods that do not require the purification of mRNA will be valuable for several applications, including samples with low input amounts and/or partial degradation. In these experiments, it is necessary to reduce the fraction of sequencing reads derived from ribosomal RNA. We will present results from multiple approaches, including the use of Not-So-Random (NSR) primers for reverse transcription [2] and the NuGEN Ovation RNA-Seq kit

    Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells

    Get PDF
    Recent molecular studies have shown that, even when derived from a seemingly homogenous population, individual cells can exhibit substantial differences in gene expression, protein levels and phenotypic output1, 2, 3, 4, 5, with important functional consequences4, 5. Existing studies of cellular heterogeneity, however, have typically measured only a few pre-selected RNAs1, 2 or proteins5, 6 simultaneously, because genomic profiling methods3 could not be applied to single cells until very recently7, 8, 9, 10. Here we use single-cell RNA sequencing to investigate heterogeneity in the response of mouse bone-marrow-derived dendritic cells (BMDCs) to lipopolysaccharide. We find extensive, and previously unobserved, bimodal variation in messenger RNA abundance and splicing patterns, which we validate by RNA-fluorescence in situ hybridization for select transcripts. In particular, hundreds of key immune genes are bimodally expressed across cells, surprisingly even for genes that are very highly expressed at the population average. Moreover, splicing patterns demonstrate previously unobserved levels of heterogeneity between cells. Some of the observed bimodality can be attributed to closely related, yet distinct, known maturity states of BMDCs; other portions reflect differences in the usage of key regulatory circuits. For example, we identify a module of 137 highly variable, yet co-regulated, antiviral response genes. Using cells from knockout mice, we show that variability in this module may be propagated through an interferon feedback circuit, involving the transcriptional regulators Stat2 and Irf7. Our study demonstrates the power and promise of single-cell genomics in uncovering functional diversity between cells and in deciphering cell states and circuits.National Institutes of Health (U.S.) (NIH Postdoctoral Fellowship (1F32HD075541-01))Charles H. Hood Foundation (Postdoctoral Fellowship)National Institutes of Health (U.S.) (NIH grant U54 AI057159)National Institutes of Health (U.S.) (NIH New Innovator Award (DP2 OD002230))National Institutes of Health (U.S.) (NIH CEGS Award (1P50HG006193-01))National Institutes of Health (U.S.) (NIH Pioneer Awards (5DP1OD003893-03))National Institutes of Health (U.S.) (NIH Pioneer Awards (DP1OD003958-01))Broad Institute of MIT and HarvardBroad Institute of MIT and Harvard (Klarman Cell Observatory

    Comprehensive comparative analysis of strand-specific RNA sequencing methods

    Get PDF
    Strand-specific, massively parallel cDNA sequencing (RNA-seq) is a powerful tool for transcript discovery, genome annotation and expression profiling. There are multiple published methods for strand-specific RNA-seq, but no consensus exists as to how to choose between them. Here we developed a comprehensive computational pipeline to compare library quality metrics from any RNA-seq method. Using the well-annotated Saccharomyces cerevisiae transcriptome as a benchmark, we compared seven library-construction protocols, including both published and our own methods. We found marked differences in strand specificity, library complexity, evenness and continuity of coverage, agreement with known annotations and accuracy for expression profiling. Weighing each method's performance and ease, we identified the dUTP second-strand marking and the Illumina RNA ligation methods as the leading protocols, with the former benefitting from the current availability of paired-end sequencing. Our analysis provides a comprehensive benchmark, and our computational pipeline is applicable for assessment of future protocols in other organisms.Howard Hughes Medical InstituteUnited States-Israel Binational Science Foundatio
    corecore