61 research outputs found

    Interspliced transcription chimeras: Neglected pathological mechanism infiltrating gene accession queries?

    Get PDF
    AbstractOver half of the DNA of mammalian genomes is transcribed, and one of the emerging enigmas in the field of RNA research is intergenic splicing or transcription induced chimerism. We argue that fused low-copy-number transcripts constitute neglected pathological mechanism akin to copy number variation, due to loss of stoichiometric subunit ratios in protein complexes. An obstacle for transcriptomics meta-analysis of published microarrays is the traditional nomenclature of merged transcript neighbors under same accession codes. Tandem transcripts cover 4–20% of genomes but are only loosely overlapping in population. They were most enriched in systems medicine annotations concerning neurology, thalassemia and genital disorders in the GeneGo Inc. MetaCore-MetaDrugTM knowledgebase, evaluated with external randomizations here. Clinical transcriptomics is good news since new disease etiologies offer new remedies. We identified homeotic HOX-transfactors centered around BMI-1, the Grb2 adaptor network, the kallikrein system, and thalassemia RNA surveillance as vulnerable hotspot chimeras. As a cure, RNA interference would require verification of chimerism from symptomatic tissue contra healthy control tissue from the same patient

    Meta-analysis fine-mapping is often miscalibrated at single-variant resolution

    Get PDF
    Funding Information: We acknowledge all the participants and researchers of the 23 biobanks that have contributed to the GBMI. Biobank-specific acknowledgments are included in the Data S3 . We thank H. Huang, A.R. Martin, B.M. Neale, Y. Okada, K. Tsuo, J.C. Ulirsch, Y. Wang, and all the members of Finucane and Daly labs for their helpful feedback. M.K. was supported by a Nakajima Foundation Fellowship and the Masason Foundation . H.K.F. was funded by NIH grant DP5 OD024582 . Publisher Copyright: © 2022 The Author(s)Meta-analysis is pervasively used to combine multiple genome-wide association studies (GWASs). Fine-mapping of meta-analysis studies is typically performed as in a single-cohort study. Here, we first demonstrate that heterogeneity (e.g., of sample size, phenotyping, imputation) hurts calibration of meta-analysis fine-mapping. We propose a summary statistics-based quality-control (QC) method, suspicious loci analysis of meta-analysis summary statistics (SLALOM), that identifies suspicious loci for meta-analysis fine-mapping by detecting outliers in association statistics. We validate SLALOM in simulations and the GWAS Catalog. Applying SLALOM to 14 meta-analyses from the Global Biobank Meta-analysis Initiative (GBMI), we find that 67% of loci show suspicious patterns that call into question fine-mapping accuracy. These predicted suspicious loci are significantly depleted for having nonsynonymous variants as lead variant (2.7×; Fisher's exact p = 7.3 × 10−4). We find limited evidence of fine-mapping improvement in the GBMI meta-analyses compared with individual biobanks. We urge extreme caution when interpreting fine-mapping results from meta-analysis of heterogeneous cohorts.Peer reviewe

    Leveraging global multi-ancestry meta-analysis in the study of idiopathic pulmonary fibrosis genetics

    Get PDF
    Publisher Copyright: © 2022 The Author(s)The research of rare and devastating orphan diseases, such as idiopathic pulmonary fibrosis (IPF) has been limited by the rarity of the disease itself. The prognosis is poor—the prevalence of IPF is only approximately four times the incidence, limiting the recruitment of patients to trials and studies of the underlying biology. Global biobanking efforts can dramatically alter the future of IPF research. We describe a large-scale meta-analysis of IPF, with 8,492 patients and 1,355,819 population controls from 13 biobanks around the globe. Finally, we combine this meta-analysis with the largest available meta-analysis of IPF, reaching 11,160 patients and 1,364,410 population controls. We identify seven novel genome-wide significant loci, only one of which would have been identified if the analysis had been limited to European ancestry individuals. We observe notable pleiotropy across IPF susceptibility and severe COVID-19 infection and note an unexplained sex-heterogeneity effect at the strongest IPF locus MUC5B.Peer reviewe

    An expanded analysis framework for multivariate GWAS connects inflammatory biomarkers to functional variants and disease

    Get PDF
    Multivariate methods are known to increase the statistical power to detect associations in the case of shared genetic basis between phenotypes. They have, however, lacked essential analytic tools to follow-up and understand the biology underlying these associations. We developed a novel computational workflow for multivariate GWAS follow-up analyses, including fine-mapping and identification of the subset of traits driving associations (driver traits). Many follow-up tools require univariate regression coefficients which are lacking from multivariate results. Our method overcomes this problem by using Canonical Correlation Analysis to turn each multivariate association into its optimal univariate Linear Combination Phenotype (LCP). This enables an LCP-GWAS, which in turn generates the statistics required for follow-up analyses. We implemented our method on 12 highly correlated inflammatory biomarkers in a Finnish population-based study. Altogether, we identified 11 associations, four of which (F5, ABO, C1orf140 and PDGFRB) were not detected by biomarker-specific analyses. Fine-mapping identified 19 signals within the 11 loci and driver trait analysis determined the traits contributing to the associations. A phenome-wide association study on the 19 representative variants from the signals in 176,899 individuals from the FinnGen study revealed 53 disease associations (p <1 x 10(-4)). Several reported pQTLs in the 11 loci provided orthogonal evidence for the biologically relevant functions of the representative variants. Our novel multivariate analysis workflow provides a powerful addition to standard univariate GWAS analyses by enabling multivariate GWAS follow-up and thus promoting the advancement of powerful multivariate methods in genomics.Peer reviewe

    Serine 62-Phosphorylated MYC Associates with Nuclear Lamins and Its Regulation by CIP2A Is Essential for Regenerative Proliferation

    Get PDF
    An understanding of the mechanisms determining MYC's transcriptional and proliferation-promoting activities in vivo could facilitate approaches for MYC targeting. However, post-translational mechanisms that control MYC function in vivo are poorly understood. Here, we demonstrate that MYC phosphorylation at serine 62 enhances MYC accumulation on Lamin A/C-associated nuclear structures and that the protein phosphatase 2A (PP2A) inhibitor protein CIP2A is required for this process. CIP2A is also critical for serum-induced MYC phosphorylation and for MYC-elicited proliferation induction in vitro. Complementary transgenic approaches and an intestinal regeneration model further demonstrated the in vivo importance of CIP2A and serine 62 phosphorylation for MYC activity upon DNA damage. However, targeting of CIP2A did not influence the normal function of intestinal crypt cells. These data underline the importance of nuclear organization in the regulation of MYC phosphorylation, leading to an in vivo demonstration of a strategy for inhibiting MYC activity without detrimental physiological effects.Peer reviewe

    Global Biobank Meta-analysis Initiative : Powering genetic discovery across human disease

    Get PDF
    Funding Information: The work of the contributing biobanks was supported by numerous grants from governmental and charitable bodies. Biobank-specific acknowledgments and more detailed acknowledgments are included in Data S2. Initiative management, S.B.C. J.C. N.J.C. M.J.D. E.E.K. A.R.M. B.M.N. Y.O. A.V.P. D.A.v.H. R.G.W. C.J.W. W.Z. and S.Z.; individual biobank analysis, A.B. Y.B. B.M.B. C.D.B. S.C. T.-T.C. K.C. S.M.D. M.D. G.H.d.B. Y.D. N.J.D. M.-J.F. Y.-C.A.F. S.F. V.L.F. L.G.F. E.R.G. T.R.G. D.H.G. C.R.G. G.G.-A. S.E.G. L.A.G. C.H. J.B.H. W.E.H. H.H. K.H. N.I. A.I. R.J. M. Kurki, J.K. N.K. E.E.K. J.T.K. M. Kanai, T.L. K.L. M.H.L. S.L. K.L. Y.-F.L. V.L.F. R.J.F.L. E.A.L.-M. A.R.-M. S.M.-G. R.M. R.E.M. H.C.M. A.R.M. Y.M. H.M. S.E.M. I.Y.M. B.M. S.M. K.N. S.N. M.A.N.-A. K.N. Y.O. P.P. A.L.-P. A.P. B.P. S.P. M.H.P. D.J.R. N.R. M.D.R. A.R. C.S. S.S. S.S.S. J.A.S. P.S. I.S. T.T. R.T. K.T. J.U. D.A.v.H. B.V. M.V. Y.V. J.M.V. R.G.W. Y.W. S.J.W. B.N.W. K.-H.H.W. M.Z. X.Z. and S.Z.; individual biobank management, N.A. A.A.T. K.M.A.-D. P.A. K.C.B. M. Boehnke, M. Boezen, C.D.B. A.C. Z.C. C.-Y.C. J.C. N.J.C. S.M.D. S.F. Y.-C.A.F. S.F. E.F. T.G. C.R.G. C.J.G. Y.G. H.H. K.A.H. K.H. S.I.I. N.M.J. N.K. E.E.K. J.T.K. C.L. M.H.L. M.T.M.L. L.L. K.L. Y.-F.L. R.J.F.L. J.L. S.M. Y.M. K.M. I.Y.M. Y.O. C.M.O. A.V.P. B.P. D.J.P. D.J.R. M.D.R. S.S. J.W.S. H.S. K.S. T.T. U.T. R.C.T. D.A.v.H. M.V. R.G.W. D.C.W. C.W. J.W. M.Z. X.Z. and S.Z.; study design and interpretation of results, A.B. M. Boehnke, M. Boezen, B.M.B. T.-T.C. C.-Y.C. M.J.D. G.D.S. N.J.D. S.F. M.-J.F. H.K.F. E.R.G. A.G. T.G. J.B.H. J.H. K.H. R.J. M.K. E.E.K. T.K. C.M.L. V.L.F. E.A.L.-M. A.R.M. S.N. B.M.N. C.M.O. J.J.P. B.P. N.R. H.R. J.A.S. I.S. K.T. D.A.v.H. R.G.W. Y.W. D.C.W. S.J.W. C.J.W. B.N.W. J.W. K.-H.H.W. M.Z. H.Z. J.Z. W.Z. X.Z. and S.Z.; drafted and edited the paper, A.B. M. Boehnke, M. Boezen, M.J.D. G.H.d.B. N.J.D. T.R.G. J.B.H. N.I. N.M.J. M.K. V.L.F. S.M. A.R.M. H.M. S.N. B.M.N. C.M.O. B.P. H.R. C.S. J.A.S. J.W.S. K.T. Y.W. D.C.W. C.J.W. K.-H.H.W. H.Z. J.Z. W.Z. and S.Z.; primary meta-analysis and quality control, M.J.D. H.K.F. M. Kanai, J.K. J.T.K. M. Kurki, M.M. B.M.N. C.J.W. K.-H.H.W. and W.Z.; drug discovery: S.N. T.K. K.-H.H.W. W.Z. and Y.O.; fine mapping, M. Kanai, W.Z. M.J.D. and H.K.F.; polygenic risk score, Y.W. S.N. E.A.L.-M. S.K. K.T. K.L. M. Kanai, W.Z. K.W. M.-J.F. L.B. P.A. P.D. V.L.F. R.M. Y.M. B.B. S.S. J.U. E.R.G. N.J.C. I.S. Y.O. A.R.M. and J.B.H.; proteome-wide Mendelian randomization, H.Z. H.R. A.B. G.H. G.D.S. B.M.B. W.Z. B.M.N. T.R.G. and J.Z.; transcriptome-wide association study, A.B. J.B.H. W.Z. J.Z. M. Kanai, B.P. E.R.G. and N.J.C.; asthma, K.T. W.Z. Y.W. M. Kanai, S.N. Y.O. B.M.N. M.J.D. and A.R.M.; heart failure, K.-H.H.W. N.J.D. B.N.W. I.S. S.E.G. J.B.H. N.J.C. M.P. R.J.F.L. M.J.D. B.M.N. W.Z. W.E.H. and C.J.W.; idiopathic pulmonary fibrosis, J.J.P. W.Z. M.J.D. J.T.K. N.J.C. and J.B.H.; primary open-angle glaucoma, V.L.F. A.B. W.Z. Y.W. K.L. M. Kanai, E.A.L.-M. P.S. R.T. X.Z. S.N. S.S. Y.O. N.I. S.M. H.S. I.S. C.W. A.R.M. E.R.G. N.M.J. N.J.C. and J.B.H.; stroke, I.S. K.-H.H.W. W.H. B.N.W. W.Z. J.E.H. A.P. B.B. A.H.S. M.E.G. R.G.W. K.H. C.K. S.Z. M.J.D. B.M.N. and C.J.W.; venous thromboembolism, B.N.W. I.S. K.-H.H.W. B.B. V.L.F. K.T. M.D. B.N. W.Z. J.A.S. and C.J.W. All authors reviewed the manuscript. M.J.D. is a founder of Maze Therapeutics. B.M.N. is a member of the scientific advisory board at Deep Genomics and a consultant for Camp4 Therapeutics, Takeda Pharmaceutical, and Biogen. The spouse of C.J.W. works at Regeneron Pharmaceuticals. C.-Y.C. is employed by Biogen. C.R.G. owns stock in 23andMe, Inc. T.R.G. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. E.E.K. has received speaker fees from Regeneron, Illumina, and 23andMe and is a member of the advisory board for Galateo Bio. R.E.M. has received speaker fees from Illumina and is a scientific advisor to the Epigenetic Clock Development Foundation. G.D.S. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. K.S. and U.T. are employed by deCODE Genetics/Amgen, Inc. J.Z. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. S.M. is a co-founder of and holds stock in Seonix Bio. Publisher Copyright: © 2022Biobanks facilitate genome-wide association studies (GWASs), which have mapped genomic loci across a range of human diseases and traits. However, most biobanks are primarily composed of individuals of European ancestry. We introduce the Global Biobank Meta-analysis Initiative (GBMI)—a collaborative network of 23 biobanks from 4 continents representing more than 2.2 million consented individuals with genetic data linked to electronic health records. GBMI meta-analyzes summary statistics from GWASs generated using harmonized genotypes and phenotypes from member biobanks for 14 exemplar diseases and endpoints. This strategy validates that GWASs conducted in diverse biobanks can be integrated despite heterogeneity in case definitions, recruitment strategies, and baseline characteristics. This collaborative effort improves GWAS power for diseases, benefits understudied diseases, and improves risk prediction while also enabling the nomination of disease genes and drug candidates by incorporating gene and protein expression data and providing insight into the underlying biology of human diseases and traits.Peer reviewe

    Secretor Genotype (FUT2 gene) Is Strongly Associated with the Composition of Bifidobacteria in the Human Intestine

    Get PDF
    Intestinal microbiota plays an important role in human health, and its composition is determined by several factors, such as diet and host genotype. However, thus far it has remained unknown which host genes are determinants for the microbiota composition. We studied the diversity and abundance of dominant bacteria and bifidobacteria from the faecal samples of 71 healthy individuals. In this cohort, 14 were non-secretor individuals and the remainders were secretors. The secretor status is defined by the expression of the ABH and Lewis histo-blood group antigens in the intestinal mucus and other secretions. It is determined by fucosyltransferase 2 enzyme, encoded by the FUT2 gene. Non-functional enzyme resulting from a nonsense mutation in the FUT2 gene leads to the non-secretor phenotype. PCR-DGGE and qPCR methods were applied for the intestinal microbiota analysis. Principal component analysis of bifidobacterial DGGE profiles showed that the samples of non-secretor individuals formed a separate cluster within the secretor samples. Moreover, bifidobacterial diversity (p<0.0001), richness (p<0.0003), and abundance (p<0.05) were significantly reduced in the samples from the non-secretor individuals as compared with those from the secretor individuals. The non-secretor individuals lacked, or were rarely colonized by, several genotypes related to B. bifidum, B. adolescentis and B. catenulatum/pseudocatenulatum. In contrast to bifidobacteria, several bacterial genotypes were more common and the richness (p<0.04) of dominant bacteria as detected by PCR-DGGE was higher in the non-secretor individuals than in the secretor individuals. We showed that the diversity and composition of the human bifidobacterial population is strongly associated with the histo-blood group ABH secretor/non-secretor status, which consequently appears to be one of the host genetic determinants for the composition of the intestinal microbiota. This association can be explained by the difference between the secretor and non-secretor individuals in their expression of ABH and Lewis glycan epitopes in the mucosa

    Leveraging global multi-ancestry meta-analysis in the study of idiopathic pulmonary fibrosis genetics

    Get PDF
    The research of rare and devastating orphan diseases, such as idiopathic pulmonary fibrosis (IPF) has been limited by the rarity of the disease itself. The prognosis is poor—the prevalence of IPF is only approximately four times the incidence, limiting the recruitment of patients to trials and studies of the underlying biology. Global biobanking efforts can dramatically alter the future of IPF research. We describe a large-scale meta-analysis of IPF, with 8,492 patients and 1,355,819 population controls from 13 biobanks around the globe. Finally, we combine this meta-analysis with the largest available meta-analysis of IPF, reaching 11,160 patients and 1,364,410 population controls. We identify seven novel genome-wide significant loci, only one of which would have been identified if the analysis had been limited to European ancestry individuals. We observe notable pleiotropy across IPF susceptibility and severe COVID-19 infection and note an unexplained sex-heterogeneity effect at the strongest IPF locus MUC5B.publishedVersionPeer reviewe
    • 

    corecore