168 research outputs found

    DISSECTING PATHWAYS WITH THE YEAST KNOCKOUT COLLECTION

    Get PDF
    The yeast knockout collections provide opportunities to perform massively parallel phenotyping of deletion mutants for almost every yeast open reading frame. I used the knockout collection to screen for synthetic lethal partners, defined as alleles that cause lethality when combined but are nonlethal alone, with CTF4 and CTF18 and present the results in Chapter 2. I developed procedures for interpreting microarrays designed to compare changes in oligonucleotide TAGs specific to each knockout strain and present those methods in Chapters 3 and 4. These TAG microarrays allow thousands of experiments to screen for synthetic lethality among pairs of null alleles to be accomplished relatively quickly. In Chapter 4, I present 1410 novel predicted synthetic lethal interactions based on 707 currently completed screens. Interpretation of synthetic lethality is presented with a computational approach in Chapter 5, termed the congruence score. High congruence scores associate genes into common pathways, and I use the method to predict that YLL049W is a component of the dynein-dynactin nuclear orientation pathway. In Chapter 6, I propose a generalization of the congruence score to any phenotype, such as growth rate in the presence of various compounds, or even nonquantitative phenotypes such as cell morphology. This procedure connects genes based on similarity of multiple phenotypes using an application of information theory to produce a shared information score. Using gene ontology similarity, I show that high scores are associated with similarly annotated genes

    Commensurate distances and similar motifs in genetic congruence and protein interaction networks in yeast

    Get PDF
    BACKGROUND: In a genetic interaction, the phenotype of a double mutant differs from the combined phenotypes of the underlying single mutants. When the single mutants have no growth defect, but the double mutant is lethal or exhibits slow growth, the interaction is termed synthetic lethality or synthetic fitness. These genetic interactions reveal gene redundancy and compensating pathways. Recently available large-scale data sets of genetic interactions and protein interactions in Saccharomyces cerevisiae provide a unique opportunity to elucidate the topological structure of biological pathways and how genes function in these pathways. RESULTS: We have defined congruent genes as pairs of genes with similar sets of genetic interaction partners and constructed a genetic congruence network by linking congruent genes. By comparing path lengths in three types of networks (genetic interaction, genetic congruence, and protein interaction), we discovered that high genetic congruence not only exhibits correlation with direct protein interaction linkage but also exhibits commensurate distance with the protein interaction network. However, consistent distances were not observed between genetic and protein interaction networks. We also demonstrated that congruence and protein networks are enriched with motifs that indicate network transitivity, while the genetic network has both transitive (triangle) and intransitive (square) types of motifs. These results suggest that robustness of yeast cells to gene deletions is due in part to two complementary pathways (square motif) or three complementary pathways, any two of which are required for viability (triangle motif). CONCLUSION: Genetic congruence is superior to genetic interaction in prediction of protein interactions and function associations. Genetically interacting pairs usually belong to parallel compensatory pathways, which can generate transitive motifs (any two of three pathways needed) or intransitive motifs (either of two pathways needed)

    Improved microarray methods for profiling the yeast knockout strain collection

    Get PDF
    A remarkable feature of the Yeast Knockout strain collection is the presence of two unique 20mer TAG sequences in almost every strain. In principle, the relative abundances of strains in a complex mixture can be profiled swiftly and quantitatively by amplifying these sequences and hybridizing them to microarrays, but TAG microarrays have not been widely used. Here, we introduce a TAG microarray design with sophisticated controls and describe a robust method for hybridizing high concentrations of dye-labeled TAGs in single-stranded form. We also highlight the importance of avoiding PCR contamination and provide procedures for detection and eradication. Validation experiments using these methods yielded false positive (FP) and false negative (FN) rates for individual TAG detection of 3–6% and 15–18%, respectively. Analysis demonstrated that cross-hybridization was the chief source of FPs, while TAG amplification defects were the main cause of FNs. The materials, protocols, data and associated software described here comprise a suite of experimental resources that should facilitate the use of TAG microarrays for a wide variety of genetic screens

    Improved statistical analysis of budding yeast TAG microarrays revealed by defined spike-in pools

    Get PDF
    Saccharomyces cerevisiae knockout collection TAG microarrays are an emergent platform for rapid, genome-wide functional characterization of yeast genes. TAG arrays report abundance of unique oligonucleotide ‘TAG’ sequences incorporated into each deletion mutation of the yeast knockout collection, allowing measurement of relative strain representation across experimental conditions for all knockout mutants simultaneously. One application of TAG arrays is to perform genome-wide synthetic lethality screens, known as synthetic lethality analyzed by microarray (SLAM). We designed a fully defined spike-in pool to resemble typical SLAM experiments and performed TAG microarray hybridizations. We describe a method for analyzing two-color array data to efficiently measure the differential knockout strain representation across two experimental conditions, and use the spike-in pool to show that the sensitivity and specificity of this method exceed typical current approaches

    Gene function prediction from congruent synthetic lethal interactions in yeast

    Get PDF
    We predicted gene function using synthetic lethal genetic interactions between null alleles in Saccharomyces cerevisiae. Phenotypic and protein interaction data indicate that synthetic lethal gene pairs function in parallel or compensating pathways. Congruent gene pairs, defined as sharing synthetic lethal partners, are in single pathway branches. We predicted benomyl sensitivity and nuclear migration defects using congruence; these phenotypes were uncorrelated with direct synthetic lethality. We also predicted YLL049W as a new member of the dynein–dynactin pathway and provided new supporting experimental evidence. We performed synthetic lethal screens of the parallel mitotic exit network (MEN) and Cdc14 early anaphase release pathways required for late cell cycle. Synthetic lethal interactions bridged genes in these pathways, and high congruence linked genes within each pathway. Synthetic lethal interactions between MEN and all components of the Sin3/Rpd3 histone deacetylase revealed a novel function for Sin3/Rpd3 in promoting mitotic exit in parallel to MEN. These in silico methods can predict phenotypes and gene functions and are applicable to genomic synthetic lethality screens in yeast and analogous RNA interference screens in metazoans

    Powerful, Scalable and Resource-Efficient Meta-Analysis of Rare Variant Associations in Large Whole Genome Sequencing Studies

    Get PDF
    Meta-analysis of whole genome sequencing/whole exome sequencing (WGS/WES) studies provides an attractive solution to the problem of collecting large sample sizes for discovering rare variants associated with complex phenotypes. Existing rare variant meta-analysis approaches are not scalable to biobank-scale WGS data. Here we present MetaSTAAR, a powerful and resource-efficient rare variant meta-analysis framework for large-scale WGS/WES studies. MetaSTAAR accounts for relatedness and population structure, can analyze both quantitative and dichotomous traits and boosts the power of rare variant tests by incorporating multiple variant functional annotations. Through meta-analysis of four lipid traits in 30,138 ancestrally diverse samples from 14 studies of the Trans Omics for Precision Medicine (TOPMed) Program, we show that MetaSTAAR performs rare variant meta-analysis at scale and produces results comparable to using pooled data. Additionally, we identified several conditionally significant rare variant associations with lipid traits. We further demonstrate that MetaSTAAR is scalable to biobank-scale cohorts through meta-analysis of TOPMed WGS data and UK Biobank WES data of ~200,000 samples

    A Framework For Detecting Noncoding Rare-Variant associations of Large-Scale Whole-Genome Sequencing Studies

    Get PDF
    Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 toPMed samples. We also analyze five non-lipid toPMed traits

    Type 2 Diabetes Modifies the association of Cad Genomic Risk Variants With Subclinical atherosclerosis

    Get PDF
    BACKGROUND: Individuals with type 2 diabetes (T2D) have an increased risk of coronary artery disease (CAD), but questions remain about the underlying pathology. Identifying which CAD loci are modified by T2D in the development of subclinical atherosclerosis (coronary artery calcification [CAC], carotid intima-media thickness, or carotid plaque) may improve our understanding of the mechanisms leading to the increased CAD in T2D. METHODS: We compared the common and rare variant associations of known CAD loci from the literature on CAC, carotid intima-media thickness, and carotid plaque in up to 29 670 participants, including up to 24 157 normoglycemic controls and 5513 T2D cases leveraging whole-genome sequencing data from the Trans-Omics for Precision Medicine program. We included first-order T2D interaction terms in each model to determine whether CAD loci were modified by T2D. The genetic main and interaction effects were assessed using a joint test to determine whether a CAD variant, or gene-based rare variant set, was associated with the respective subclinical atherosclerosis measures and then further determined whether these loci had a significant interaction test. RESULTS: Using a Bonferroni-corrected significance threshold of CONCLUSIONS: These results highlight T2D as an important modifier of rare variant associations in CAD loci with CAC

    Validation of human telomere length multi-ancestry meta-analysis association signals identifies POP5 and KBTBD6 as human telomere length regulation genes

    Get PDF
    Genome-wide association studies (GWAS) have become well-powered to detect loci associated with telomere length. However, no prior work has validated genes nominated by GWAS to examine their role in telomere length regulation. We conducted a multi-ancestry meta-analysis of 211,369 individuals and identified five novel association signals. Enrichment analyses of chromatin state and cell-type heritability suggested that blood/immune cells are the most relevant cell type to examine telomere length association signals. We validated specific GWAS associations by overexpressing KBTBD6 or POP5 and demonstrated that both lengthened telomeres. CRISPR/Cas9 deletion of the predicted causal regions in K562 blood cells reduced expression of these genes, demonstrating that these loci are related to transcriptional regulation of KBTBD6 and POP5. Our results demonstrate the utility of telomere length GWAS in the identification of telomere length regulation mechanisms and validate KBTBD6 and POP5 as genes affecting telomere length regulation
    corecore