279 research outputs found

    Accurate estimation of SNP-heritability from biobank-scale data irrespective of genetic architecture.

    Get PDF
    SNP-heritability is a fundamental quantity in the study of complex traits. Recent studies have shown that existing methods to estimate genome-wide SNP-heritability can yield biases when their assumptions are violated. While various approaches have been proposed to account for frequency- and linkage disequilibrium (LD)-dependent genetic architectures, it remains unclear which estimates reported in the literature are reliable. Here we show that genome-wide SNP-heritability can be accurately estimated from biobank-scale data irrespective of genetic architecture, without specifying a heritability model or partitioning SNPs by allele frequency and/or LD. We show analytically and through extensive simulations starting from real genotypes (UK Biobank, N = 337 K) that, unlike existing methods, our closed-form estimator is robust across a wide range of architectures. We provide estimates of SNP-heritability for 22 complex traits in the UK Biobank and show that, consistent with our results in simulations, existing biobank-scale methods yield estimates up to 30% different from our theoretically-justified approach

    Algorithms for Viral Population Analysis

    Get PDF
    The genetic structure of an intra-host viral population has an effect on many clinically important phenotypic traits such as escape from vaccine induced immunity, virulence, and response to antiviral therapies. Next-generation sequencing provides read-coverage sufficient for genomic reconstruction of a heterogeneous, yet highly similar, viral population; and more specifically, for the detection of rare variants. Admittedly, while depth is less of an issue for modern sequencers, the short length of generated reads complicates viral population assembly. This task is worsened by the presence of both random and systematic sequencing errors in huge amounts of data. In this dissertation I present completed work for reconstructing a viral population given next-generation sequencing data. Several algorithms are described for solving this problem under the error-free amplicon (or sliding-window) model. In order for these methods to handle actual real-world data, an error-correction method is proposed. A formal derivation of its likelihood model along with optimization steps for an EM algorithm are presented. Although these methods perform well, they cannot take into account paired-end sequencing data. In order to address this, a new method is detailed that works under the error-free paired-end case along with maximum a-posteriori estimation of the model parameters

    Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights

    Get PDF
    Genome-wide association studies (GWAS) have identified over 100 risk loci for schizophrenia, but the causal mechanisms remain largely unknown. We performed a transcriptome-wide association study (TWAS) integrating a schizophrenia GWAS of 79,845 individuals from the Psychiatric Genomics Consortium with expression data from brain, blood, and adipose tissues across 3,693 primarily control individuals. We identified 157 TWAS-significant genes, of which 35 did not overlap a known GWAS locus. Of these 157 genes, 42 were associated with specific chromatin features measured in independent samples, thus highlighting potential regulatory targets for follow-up. Suppression of one identified susceptibility gene, mapk3, in zebrafish showed a significant effect on neurodevelopmental phenotypes. Expression and splicing from the brain captured most of the TWAS effect across all genes. This large-scale connection of associations to target genes, tissues, and regulatory features is an essential step in moving toward a mechanistic understanding of GWAS

    Accurate viral population assembly from ultra-deep sequencing data

    Get PDF
    MotivationNext-generation sequencing technologies sequence viruses with ultra-deep coverage, thus promising to revolutionize our understanding of the underlying diversity of viral populations. While the sequencing coverage is high enough that even rare viral variants are sequenced, the presence of sequencing errors makes it difficult to distinguish between rare variants and sequencing errors.ResultsIn this article, we present a method to overcome the limitations of sequencing technologies and assemble a diverse viral population that allows for the detection of previously undiscovered rare variants. The proposed method consists of a high-fidelity sequencing protocol and an accurate viral population assembly method, referred to as Viral Genome Assembler (VGA). The proposed protocol is able to eliminate sequencing errors by using individual barcodes attached to the sequencing fragments. Highly accurate data in combination with deep coverage allow VGA to assemble rare variants. VGA uses an expectation-maximization algorithm to estimate abundances of the assembled viral variants in the population. RESULTS on both synthetic and real datasets show that our method is able to accurately assemble an HIV viral population and detect rare variants previously undetectable due to sequencing errors. VGA outperforms state-of-the-art methods for genome-wide viral assembly. Furthermore, our method is the first viral assembly method that scales to millions of sequencing reads.AvailabilityOur tool VGA is freely available at http://genetics.cs.ucla.edu/vga

    Twas_SIM, a Python-Based Tool for Simulation and Power Analysis of Transcriptome-Wide Association Analysis

    Get PDF
    Genome-wide association studies (GWASs) have identified numerous genetic variants associated with complex disease risk; however, most of these associations are non-coding, complicating identifying their proximal target gene. Transcriptome-wide association studies (TWASs) have been proposed to mitigate this gap by integrating expression quantitative trait loci (eQTL) data with GWAS data. Numerous methodological advancements have been made for TWAS, yet each approach requires ad hoc simulations to demonstrate feasibility. Here, we present twas_sim, a computationally scalable and easily extendable tool for simplified performance evaluation and power analysis for TWAS methods. Software and documentation are available at https://github.com/mancusolab/twas_sim

    Identification of TBX15 as an adipose master trans regulator of abdominal obesity genes

    Get PDF
    Background: Obesity predisposes individuals to multiple cardiometabolic disorders, including type 2 diabetes (T2D). As body mass index (BMI) cannot reliably differentiate fat from lean mass, the metabolically detrimental abdominal obesity has been estimated using waist-hip ratio (WHR). Waist-hip ratio adjusted for body mass index (WHRadjBMI) in turn is a well-established sex-specific marker for abdominal fat and adiposity, and a predictor of adverse metabolic outcomes, such as T2D. However, the underlying genes and regulatory mechanisms orchestrating the sex differences in obesity and body fat distribution in humans are not well understood. Methods: We searched for genetic master regulators of WHRadjBMI by employing integrative genomics approaches on human subcutaneous adipose RNA sequencing (RNA-seq) data (n similar to 1400) and WHRadjBMI GWAS data (n similar to 700,000) from the WHRadjBMI GWAS cohorts and the UK Biobank (UKB), using co-expression network, transcriptome-wide association study (TWAS), and polygenic risk score (PRS) approaches. Finally, we functionally verified our genomic results using gene knockdown experiments in a human primary cell type that is critical for adipose tissue function. Results: Here, we identified an adipose gene co-expression network that contains 35 obesity GWAS genes and explains a significant amount of polygenic risk for abdominal obesity and T2D in the UKB (n = 392,551) in a sex-dependent way. We showed that this network is preserved in the adipose tissue data from the Finnish Kuopio Obesity Study and Mexican Obesity Study. The network is controlled by a novel adipose master transcription factor (TF), TBX15, a WHRadjBMI GWAS gene that regulates the network in trans. Knockdown of TBX15 in human primary preadipocytes resulted in changes in expression of 130 network genes, including the key adipose TFs, PPARG and KLF15, which were significantly impacted (FDR < 0.05), thus functionally verifying the trans regulatory effect of TBX15 on the WHRadjBMI co-expression network. Conclusions: Our study discovers a novel key function for the TBX15 TF in trans regulating an adipose co-expression network of 347 adipose, mitochondrial, and metabolically important genes, including PPARG, KLF15, PPARA, ADIPOQ, and 35 obesity GWAS genes. Thus, based on our converging genomic, transcriptional, and functional evidence, we interpret the role of TBX15 to be a main transcriptional regulator in the adipose tissue and discover its importance in human abdominal obesity.Peer reviewe

    Application of Patients Reported Outcome measures in Cochlear Implant patients: implications for the design of specific rehabilitation programs

    Get PDF
    Introduction: Cochlear implants (CI) had been developed to enable a satisfying verbal communication, music experiences have remained in the background of research and development and consequently many CI users are dissatisfied by the music they listen to. Nonetheless, concise indications for clinicians to test music abilities and prescribe rehabilitation programs are still lacking. The main aim of the present study was to test the utility of the application of two different Patient Reporting Outcomes (PRO) measures in a group of CI users. A secondary objective was to identify items capable of driving the indication and design specific music rehabilitation programs for CI patients. Material and Methods: A consecutive series of 73 CI patients referred to the Audiology Unit - University of Padova - was enrolled from November 2021 to May 2022 and evaluated with audiological battery test and PRO measures: Musica e Qualità della Vita (MUSQUAV) and Nijmegen Cochlear Implant Questionnaire (NCIQ) Italian version. Results: The Reliability analysis showed good consistency between the different PRO measures, (Cronbach alpha = 0.873). After accounting for epidemiological and clinical variables, PRO measures showed a correlation with audiological outcomes in only one case (rho=-0.170 for NCIQ-T with CI-Pure Tone Average. A willingness for musical rehabilitation was present in 63% of patients, (Rehab Factor, mean value of 0.791±0.675). Conclusions: we support the role of the application of MUSQUAV and NCIQ to improve the clinical and audiological evaluation of CI patients. Moreover, we proposed a derivative item, called Rehab Factor, which could be used in clinical practice and future studies to clarify the indication and priority of specific music rehabilitation programs
    corecore