126 research outputs found

    Software and methods for oligonucleotide and cDNA array data analysis.

    Get PDF
    Two HTML-based programs were developed to analyze and filter gene-expression data: 'Bullfrog' for Affymetrix oligonucleotide arrays and 'Spot' for custom cDNA arrays. The programs provide intuitive data-filtering tools through an easy-to-use interface. A background subtraction and normalization program for cDNA arrays was also built that provides an informative summary report with data-quality assessments. These programs are freeware to aid in the analysis of gene-expression results and facilitate the search for genes responsible for interesting biological processes and phenotypes

    Genetic dissection of the pluripotent proteome through multi-omics data integration.

    Get PDF
    Genetic background drives phenotypic variability in pluripotent stem cells (PSCs). Most studies to date have used transcript abundance as the primary molecular readout of cell state in PSCs. We performed a comprehensive proteogenomics analysis of 190 genetically diverse mouse embryonic stem cell (mESC) lines. The quantitative proteome is highly variable across lines, and we identified pluripotency-associated pathways that were differentially activated in the proteomics data that were not evident in transcriptome data from the same lines. Integration of protein abundance to transcript levels and chromatin accessibility revealed broad co-variation across molecular layers as well as shared and unique drivers of quantitative variation in pluripotency-associated pathways. Quantitative trait locus (QTL) mapping localized the drivers of these multi-omic signatures to genomic hotspots. This study reveals post-transcriptional mechanisms and genetic interactions that underlie quantitative variability in the pluripotent proteome and provides a regulatory map for mESCs that can provide a basis for future mechanistic studies

    A unified framework for multi-locus association analysis of both common and rare variants

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Common, complex diseases are hypothesized to result from a combination of common and rare genetic variants. We developed a unified framework for the joint association testing of both types of variants. Within the framework, we developed a union-intersection test suitable for genome-wide analysis of single nucleotide polymorphisms (SNPs), candidate gene data, as well as medical sequencing data. The union-intersection test is a composite test of association of genotype frequencies and differential correlation among markers.</p> <p>Results</p> <p>We demonstrated by computer simulation that the false positive error rate was controlled at the expected level. We also demonstrated scenarios in which the multi-locus test was more powerful than traditional single marker analysis. To illustrate use of the union-intersection test with real data, we analyzed a publically available data set of 319,813 autosomal SNPs genotyped for 938 cases of Parkinson disease and 863 neurologically normal controls for which no genome-wide significant results were found by traditional single marker analysis. We also analyzed an independent follow-up sample of 183 cases and 248 controls for replication.</p> <p>Conclusions</p> <p>We identified a single risk haplotype with a directionally consistent effect in both samples in the gene <it>GAK</it>, which is involved in clathrin-mediated membrane trafficking. We also found suggestive evidence that directionally inconsistent marginal effects from single marker analysis appeared to result from risk being driven by different haplotypes in the two samples for the genes <it>SYN3 </it>and <it>NGLY1</it>, which are involved in neurotransmitter release and proteasomal degradation, respectively. These results illustrate the utility of our unified framework for genome-wide association analysis of common, complex diseases.</p

    Understanding the complex genetic architecture connecting rheumatoid arthritis, osteoporosis and inflammation:discovering causal pathways

    Get PDF
    Rheumatoid arthritis (RA) and osteoporosis (OP) are two comorbid complex inflammatory conditions with evidence of shared genetic background and causal relationships. We aimed to clarify the genetic architecture underlying RA and various OP phenotypes while additionally considering an inflammatory component, C-reactive protein (CRP). Genome-wide association study summary statistics were acquired from the GEnetic Factors for OSteoporosis Consortium, Cohorts for Heart and Aging Research Consortium and UK Biobank. Mendelian randomization (MR) was used to detect the presence of causal relationships. Colocalization analysis was performed to determine shared genetic variants between CRP and OP phenotypes. Analysis of pleiotropy between traits owing to shared causal single nucleotide polymorphisms (SNPs) was performed using PL eiotropic A nalysis under CO mposite null hypothesis (PLACO). MR analysis was suggestive of horizontal pleiotropy between RA and OP traits. RA was a significant causal risk factor for CRP (β = 0.027, 95% confidence interval = 0.016-0.038). There was no evidence of CRP→OP causal relationship, but horizontal pleiotropy was apparent. Colocalization established shared genomic regions between CRP and OP, including GCKR and SERPINA1 genes. Pleiotropy arising from shared causal SNPs revealed through the colocalization analysis was all confirmed by PLACO. These genes were found to be involved in the same molecular function 'protein binding' (GO:0005515) associated with RA, OP and CRP. We identified three major components explaining the epidemiological relationship among RA, OP and inflammation: (1) Pleiotropy explains a portion of the shared genetic relationship between RA and OP, albeit polygenically; (2) RA contributes to CRP elevation and (3) CRP, which is influenced by RA, demonstrated pleiotropy with OP.</p

    Association Between Whole Blood-Derived Mitochondrial Dna Copy Number, Low-Density Lipoprotein Cholesterol, and Cardiovascular Disease Risk

    Get PDF
    Background The relationship between mitochondrial DNA copy number (mtDNA CN) and cardiovascular disease remains elusive. Methods and Results We performed cross-sectional and prospective association analyses of blood-derived mtDNA CN and cardiovascular disease outcomes in 27 316 participants in 8 cohorts of multiple racial and ethnic groups with whole-genome sequencing. We also performed Mendelian randomization to explore causal relationships of mtDNA CN with coronary heart disease (CHD) and cardiometabolic risk factors (obesity, diabetes, hypertension, and hyperlipidemia)

    Mammographic density and risk of breast cancer by age and tumor characteristics

    Get PDF
    Introduction: Understanding whether mammographic density (MD) is associated with all breast tumor subtypes and whether the strength of association varies by age is important for utilizing MD in risk models. Methods: Data were pooled from six studies including 3414 women with breast cancer and 7199 without who underwent screening mammography. Percent MD was assessed from digitized film-screen mammograms using a computer-assisted threshold technique. We used polytomous logistic regression to calculate breast cancer odds according to tumor type, histopathological characteristics, and receptor (estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor (HER2)) status by age (51%) versus average density (11-25%). Women ages 2.1 cm) versus small tumors and positive versus negative lymph node status (P’s < 0.01). For women ages <55 years, there was a stronger association of MD with ER-negative breast cancer than ER-positive tumors compared to women ages 55–64 and ≥65 years (Page-interaction = 0.04). MD was positively associated with both HER2-negative and HER2-positive tumors within each age group. Conclusion: MD is strongly associated with all breast cancer subtypes, but particularly tumors of large size and positive lymph nodes across all ages, and ER-negative status among women ages <55 years, suggesting high MD may play an important role in tumor aggressiveness, especially in younger women

    Mosaic Chromosomal alterations in Blood across ancestries Using Whole-Genome Sequencing

    Get PDF
    Megabase-scale mosaic chromosomal alterations (mCAs) in blood are prognostic markers for a host of human diseases. Here, to gain a better understanding of mCA rates in genetically diverse populations, we analyzed whole-genome sequencing data from 67,390 individuals from the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine program. We observed higher sensitivity with whole-genome sequencing data, compared with array-based data, in uncovering mCAs at low mutant cell fractions and found that individuals of European ancestry have the highest rates of autosomal mCAs and the lowest rates of chromosome X mCAs, compared with individuals of African or Hispanic ancestry. Although further studies in diverse populations will be needed to replicate our findings, we report three loci associated with loss of chromosome X, associations between autosomal mCAs and rare variants in DCPS, ADM17, PPP1R16B and TET2 and ancestry-specific variants in ATM and MPL with mCAs in cis

    Genetic Signatures of Exceptional Longevity in Humans

    Get PDF
    Like most complex phenotypes, exceptional longevity is thought to reflect a combined influence of environmental (e.g., lifestyle choices, where we live) and genetic factors. To explore the genetic contribution, we undertook a genome-wide association study of exceptional longevity in 801 centenarians (median age at death 104 years) and 914 genetically matched healthy controls. Using these data, we built a genetic model that includes 281 single nucleotide polymorphisms (SNPs) and discriminated between cases and controls of the discovery set with 89% sensitivity and specificity, and with 58% specificity and 60% sensitivity in an independent cohort of 341 controls and 253 genetically matched nonagenarians and centenarians (median age 100 years). Consistent with the hypothesis that the genetic contribution is largest with the oldest ages, the sensitivity of the model increased in the independent cohort with older and older ages (71% to classify subjects with an age at death>102 and 85% to classify subjects with an age at death>105). For further validation, we applied the model to an additional, unmatched 60 centenarians (median age 107 years) resulting in 78% sensitivity, and 2863 unmatched controls with 61% specificity. The 281 SNPs include the SNP rs2075650 in TOMM40/APOE that reached irrefutable genome wide significance (posterior probability of association = 1) and replicated in the independent cohort. Removal of this SNP from the model reduced the accuracy by only 1%. Further in-silico analysis suggests that 90% of centenarians can be grouped into clusters characterized by different “genetic signatures” of varying predictive values for exceptional longevity. The correlation between 3 signatures and 3 different life spans was replicated in the combined replication sets. The different signatures may help dissect this complex phenotype into sub-phenotypes of exceptional longevity

    Genetic determinants of telomere length from 109,122 ancestrally diverse whole-genome sequences in TOPMed

    Get PDF
    Genetic studies on telomere length are important for understanding age-related diseases. Prior GWAS for leukocyte TL have been limited to European and Asian populations. Here, we report the first sequencing-based association study for TL across ancestrally-diverse individuals (European, African, Asian and Hispanic/Latino) from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. We used whole genome sequencing (WGS) of whole blood for variant genotype calling and the bioinformatic estimation of telomere length in n=109,122 individuals. We identified 59 sentinel variants (p-value OBFC1indicated the independent signals colocalized with cell-type specific eQTLs for OBFC1 (STN1). Using a multi-variant gene-based approach, we identified two genes newly implicated in telomere length, DCLRE1B (SNM1B) and PARN. In PheWAS, we demonstrated our TL polygenic trait scores (PTS) were associated with increased risk of cancer-related phenotypes

    Associations of common breast cancer susceptibility alleles with risk of breast cancer subtypes in BRCA1 and BRCA2 mutation carriers

    Get PDF
    Introduction: More than 70 common alleles are known to be involved in breast cancer (BC) susceptibility, and several exhibit significant heterogeneity in their associations with different BC subtypes. Although there are differences in the association patterns between BRCA1 and BRCA2 mutation carriers and the general population for several loci, no study has comprehensively evaluated the associations of all known BC susceptibility alleles with risk of BC subtypes in BRCA1 and BRCA2 carriers. Methods: We used data from 15,252 BRCA1 and 8,211 BRCA2 carriers to analyze the associations between approximately 200,000 genetic variants on the iCOGS array and risk of BC subtypes defined by estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) and triple-negative- (TN) status; morphologic subtypes; histological grade; and nodal involvement. Results: The estimated BC hazard ratios (HRs) for the 74 known BC alleles in BRCA1 carriers exhibited moderate correlations with the corresponding odds ratios from the general population. However, their associations with ER-positive BC in BRCA1 carriers were more consistent with the ER-positive as
    corecore