66 research outputs found
Bioconductor: open software development for computational biology and bioinformatics.
The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. The goals of the project include: fostering collaborative development and widespread use of innovative software, reducing barriers to entry into interdisciplinary scientific research, and promoting the achievement of remote reproducibility of research results. We describe details of our aims and methods, identify current challenges, compare Bioconductor to other open bioinformatics projects, and provide working examples
Recommended from our members
Whole-exome sequencing and clinical interpretation of FFPE tumor samples to guide precision cancer medicine
Translating whole exome sequencing (WES) for prospective clinical use may impact the care of cancer patients; however, multiple innovations are necessary for clinical implementation. These include: (1) rapid and robust WES from formalin-fixed paraffin embedded (FFPE) tumor tissue, (2) analytical output similar to data from frozen samples, and (3) clinical interpretation of WES data for prospective use. Here, we describe a prospective clinical WES platform for archival FFPE tumor samples. The platform employs computational methods for effective clinical analysis and interpretation of WES data. When applied retrospectively to 511 exomes, the interpretative framework revealed a “long tail” of somatic alterations in clinically important genes. Prospective application of this approach identified clinically relevant alterations in 15/16 patients. In one patient, previously undetected findings guided clinical trial enrollment leading to an objective clinical response. Overall, this methodology may inform the widespread implementation of precision cancer medicine
The effect of LRRK2 loss-of-function variants in humans
Analysis of large genomic datasets, including gnomAD, reveals that partial LRRK2 loss of function is not strongly associated with diseases, serving as an example of how human genetics can be leveraged for target validation in drug discovery. Human genetic variants predicted to cause loss-of-function of protein-coding genes (pLoF variants) provide natural in vivo models of human gene inactivation and can be valuable indicators of gene function and the potential toxicity of therapeutic inhibitors targeting these genes(1,2). Gain-of-kinase-function variants in LRRK2 are known to significantly increase the risk of Parkinson's disease(3,4), suggesting that inhibition of LRRK2 kinase activity is a promising therapeutic strategy. While preclinical studies in model organisms have raised some on-target toxicity concerns(5-8), the biological consequences of LRRK2 inhibition have not been well characterized in humans. Here, we systematically analyze pLoF variants in LRRK2 observed across 141,456 individuals sequenced in the Genome Aggregation Database (gnomAD)(9), 49,960 exome-sequenced individuals from the UK Biobank and over 4 million participants in the 23andMe genotyped dataset. After stringent variant curation, we identify 1,455 individuals with high-confidence pLoF variants in LRRK2. Experimental validation of three variants, combined with previous work(10), confirmed reduced protein levels in 82.5% of our cohort. We show that heterozygous pLoF variants in LRRK2 reduce LRRK2 protein levels but that these are not strongly associated with any specific phenotype or disease state. Our results demonstrate the value of large-scale genomic databases and phenotyping of human loss-of-function carriers for target validation in drug discovery.Peer reviewe
Genetic Determinants of Lipid Traits in Diverse Populations from the Population Architecture using Genomics and Epidemiology (PAGE) Study
For the past five years, genome-wide association studies (GWAS) have identified hundreds of common variants associated with human diseases and traits, including high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), and triglyceride (TG) levels. Approximately 95 loci associated with lipid levels have been identified primarily among populations of European ancestry. The Population Architecture using Genomics and Epidemiology (PAGE) study was established in 2008 to characterize GWAS–identified variants in diverse population-based studies. We genotyped 49 GWAS–identified SNPs associated with one or more lipid traits in at least two PAGE studies and across six racial/ethnic groups. We performed a meta-analysis testing for SNP associations with fasting HDL-C, LDL-C, and ln(TG) levels in self-identified European American (∼20,000), African American (∼9,000), American Indian (∼6,000), Mexican American/Hispanic (∼2,500), Japanese/East Asian (∼690), and Pacific Islander/Native Hawaiian (∼175) adults, regardless of lipid-lowering medication use. We replicated 55 of 60 (92%) SNP associations tested in European Americans at p<0.05. Despite sufficient power, we were unable to replicate ABCA1 rs4149268 and rs1883025, CETP rs1864163, and TTC39B rs471364 previously associated with HDL-C and MAFB rs6102059 previously associated with LDL-C. Based on significance (p<0.05) and consistent direction of effect, a majority of replicated genotype-phentoype associations for HDL-C, LDL-C, and ln(TG) in European Americans generalized to African Americans (48%, 61%, and 57%), American Indians (45%, 64%, and 77%), and Mexican Americans/Hispanics (57%, 56%, and 86%). Overall, 16 associations generalized across all three populations. For the associations that did not generalize, differences in effect sizes, allele frequencies, and linkage disequilibrium offer clues to the next generation of association studies for these traits
Investigation of gene-by-sex interactions for lipid traits in diverse populations from the population architecture using genomics and epidemiology study
Abstract Background High-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), and triglyceride (TG) levels are influenced by both genes and the environment. Genome-wide association studies (GWAS) have identified ~100 common genetic variants associated with HDL-C, LDL-C, and/or TG levels, mostly in populations of European descent, but little is known about the modifiers of these associations. Here, we investigated whether GWAS-identified SNPs for lipid traits exhibited heterogeneity by sex in the Population Architecture using Genomics and Epidemiology (PAGE) study. Results A sex-stratified meta-analysis was performed for 49 GWAS-identified SNPs for fasting HDL-C, LDL-C, and ln(TG) levels among adults self-identified as European American (25,013). Heterogeneity by sex was established when phet < 0.001. There was evidence for heterogeneity by sex for two SNPs for ln(TG) in the APOA1/C3/A4/A5/BUD13 gene cluster: rs28927680 (phet = 7.4x10-7) and rs3135506 (phet = 4.3x10-4), one SNP in PLTP for HDL levels (rs7679; phet = 9.9x10-4), and one in HMGCR for LDL levels (rs12654264; phet = 3.1x10-5). We replicated heterogeneity by sex in five of seventeen loci previously reported by genome-wide studies (binomial p = 0.0009). We also present results for other racial/ethnic groups in the supplementary materials, to provide a resource for future meta-analyses. Conclusions We provide further evidence for sex-specific effects of SNPs in the APOA1/C3/A4/A5/BUD13 gene cluster, PLTP, and HMGCR on fasting triglyceride levels in European Americans from the PAGE study. Our findings emphasize the need for considering context-specific effects when interpreting genetic associations emerging from GWAS, and also highlight the difficulties in replicating interaction effects across studies and across racial/ethnic groups
No evidence of interaction between known lipid-associated genetic variants and smoking in the multi-ethnic PAGE population
Genome-wide association studies (GWAS) have identified many variants that influence high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, and/or triglycerides. However, environmental modifiers, such as smoking, of these known genotype–phenotype associations are just recently emerging in the literature. We have tested for interactions between smoking and 49 GWAS-identified variants in over 41,000 racially/ethnically diverse samples with lipid levels from the Population Architecture Using Genomics and Epidemiology (PAGE) study. Despite their biological plausibility, we were unable to detect significant SNP × smoking interactions
New genetic loci link adipose and insulin biology to body fat distribution.
Body fat distribution is a heritable trait and a well-established predictor of adverse metabolic outcomes, independent of overall adiposity. To increase our understanding of the genetic basis of body fat distribution and its molecular links to cardiometabolic traits, here we conduct genome-wide association meta-analyses of traits related to waist and hip circumferences in up to 224,459 individuals. We identify 49 loci (33 new) associated with waist-to-hip ratio adjusted for body mass index (BMI), and an additional 19 loci newly associated with related waist and hip circumference measures (P < 5 × 10(-8)). In total, 20 of the 49 waist-to-hip ratio adjusted for BMI loci show significant sexual dimorphism, 19 of which display a stronger effect in women. The identified loci were enriched for genes expressed in adipose tissue and for putative regulatory elements in adipocytes. Pathway analyses implicated adipogenesis, angiogenesis, transcriptional regulation and insulin resistance as processes affecting fat distribution, providing insight into potential pathophysiological mechanisms
Association Between Chromosome 9p21 Variants and the Ankle-Brachial Index Identified by a Meta-Analysis of 21 Genome-Wide Association Studies
Genetic determinants of peripheral arterial disease (PAD) remain largely unknown. To identify genetic variants associated with the ankle-brachial index (ABI), a noninvasive measure of PAD, we conducted a meta-analysis of genome-wide association study data from 21 population-based cohorts
The Somatic Genomic Landscape of Glioblastoma
We describe the landscape of somatic genomic alterations based on multi-dimensional and comprehensive characterization of more than 500 glioblastoma tumors (GBMs). We identify several novel mutated genes as well as complex rearrangements of signature receptors including EGFR and PDGFRA. TERT promoter mutations are shown to correlate with elevated mRNA expression, supporting a role in telomerase reactivation. Correlative analyses confirm that the survival advantage of the proneural subtype is conferred by the G-CIMP phenotype, and MGMT DNA methylation may be a predictive biomarker for treatment response only in classical subtype GBM. Integrative analysis of genomic and proteomic profiles challenges the notion of therapeutic inhibition of a pathway as an alternative to inhibition of the target itself. These data will facilitate the discovery of therapeutic and diagnostic target candidates, the validation of research and clinical observations and the generation of unanticipated hypotheses that can advance our molecular understanding of this lethal cancer
- …