689 research outputs found

    Leveraging variational autoencoders for multiple data imputation

    Full text link
    Missing data persists as a major barrier to data analysis across numerous applications. Recently, deep generative models have been used for imputation of missing data, motivated by their ability to capture highly non-linear and complex relationships in the data. In this work, we investigate the ability of deep models, namely variational autoencoders (VAEs), to account for uncertainty in missing data through multiple imputation strategies. We find that VAEs provide poor empirical coverage of missing data, with underestimation and overconfident imputations, particularly for more extreme missing data values. To overcome this, we employ β\beta-VAEs, which viewed from a generalized Bayes framework, provide robustness to model misspecification. Assigning a good value of β\beta is critical for uncertainty calibration and we demonstrate how this can be achieved using cross-validation. In downstream tasks, we show how multiple imputation with β\beta-VAEs can avoid false discoveries that arise as artefacts of imputation.Comment: 17 pages, 3 main figures, 6 supplementary figure

    Sumoylation of The Budding Yeast Kinetochore Protein Ndc10 is Required for Ndc10 Spindle Localization and Regulation of Anaphase Spindle Elongation

    Get PDF
    Posttranslational modification by the ubiquitin-like protein SUMO (small ubiquitin-like modifier) is emerging as an important regulator in many cellular processes, including genome integrity. In this study, we show that the kinetochore proteins Ndc10, Bir1, Ndc80, and Cep3, which mediate the attachment of chromosomes to spindle microtubules, are sumoylated substrates in budding yeast. Furthermore, we show that Ndc10, Bir1, and Cep3 but not Ndc80 are desumoylated upon exposure to nocodazole, highlighting the possibility of distinct roles for sumoylation in modulating kinetochore protein function and of a potential link between the sumoylation of kinetochore proteins and mitotic checkpoint function. We find that lysine to arginine mutations that eliminate the sumoylation of Ndc10 cause chromosome instability, mislocalization of Ndc10 from the mitotic spindle, abnormal anaphase spindles, and a loss of Bir1 sumoylation. These data suggest that sumoylation of Ndc10 and other kinetochore proteins play a critical role during the mitotic process

    G3, GENETICS, and the GSA: Two Journals, One Mission

    Get PDF
    With the June launch of its open-access journal G3: Genes | Genomes | Genetics, the Genetics Society of America (GSA) now offers two peer-edited journals. The missions of G3 and GENETICS are fundamentally the same: to provide a forum for timely communication of the latest findings in genetics, selected by editors who are the authors' peers. But the scopes of the two journals are different. Why offer two journals

    Proteasome Nuclear Activity Affects Chromosome Stability by Controlling the Turnover of Mms22, a Protein Important for DNA Repair

    Get PDF
    To expand the known spectrum of genes that maintain genome stability, we screened a recently released collection of temperature sensitive (Ts) yeast mutants for a chromosome instability (CIN) phenotype. Proteasome subunit genes represented a major functional group, and subsequent analysis demonstrated an evolutionarily conserved role in CIN. Analysis of individual proteasome core and lid subunit mutations showed that the CIN phenotype at semi-permissive temperature is associated with failure of subunit localization to the nucleus. The resultant proteasome dysfunction affects chromosome stability by impairing the kinetics of double strand break (DSB) repair. We show that the DNA repair protein Mms22 is required for DSB repair, and recruited to chromatin in a ubiquitin-dependent manner as a result of DNA damage. Moreover, subsequent proteasome-mediated degradation of Mms22 is necessary and sufficient for cell cycle progression through the G2/M arrest induced by DNA damage. Our results demonstrate for the first time that a double strand break repair protein is a proteasome target, and thus link nuclear proteasomal activity and DSB repair

    Monopolin subunit Csm1 associates with MIND complex to establish monopolar attachment of sister kinetochores at meiosis I

    Get PDF
    Sexually reproducing organisms halve their cellular ploidy during gametogenesis by undergoing a specialized form of cell division known as meiosis. During meiosis, a single round of DNA replication is followed by two rounds of nuclear divisions (referred to as meiosis I and II). While sister kinetochores bind to microtubules emanating from opposite spindle poles during mitosis, they bind to microtubules originating from the same spindle pole during meiosis I. This phenomenon is referred to as mono-orientation and is essential for setting up the reductional mode of chromosome segregation during meiosis I. In budding yeast, mono-orientation depends on a four component protein complex referred to as monopolin which consists of two nucleolar proteins Csm1 and Lrs4, meiosis-specific protein Mam1 of unknown function and casein kinase Hrr25. Monopolin complex binds to kinetochores during meiosis I and prevents bipolar attachments. Although monopolin associates with kinetochores during meiosis I, its binding site(s) on the kinetochore is not known and its mechanism of action has not been established. By carrying out an imaging-based screen we have found that the MIND complex, a component of the central kinetochore, is required for monopolin association with kinetochores during meiosis. Furthermore, we demonstrate that interaction of monopolin subunit Csm1 with the N-terminal domain of MIND complex subunit Dsn1, is essential for both the association of monopolin with kinetochores and for monopolar attachment of sister kinetochores during meiosis I. As such this provides the first functional evidence for a monopolin-binding site at the kinetochore

    A Re-Annotation of the Saccharomyces Cerevisiae Genome

    Get PDF
    Discrepancies in gene and orphan number indicated by previous analyses suggest that S. cerevisiae would benefit from a consistent re-annotation. In this analysis three new genes are identified and 46 alterations to gene coordinates are described. 370 ORFs are defined as totally spurious ORFs which should be disregarded. At least a further 193 genes could be described as very hypothetical, based on a number of criteria. It was found that disparate genes with sequence overlaps over ten amino acids (especially at the N-terminus) are rare in both S. cerevisiae and Sz. pombe. A new S. cerevisiae gene number estimate with an upper limit of 5804 is proposed, but after the removal of very hypothetical genes and pseudogenes this is reduced to 5570. Although this is likely to be closer to the true upper limit, it is still predicted to be an overestimate of gene number. A complete list of revised gene coordinates is available from the Sanger Centre (S. cerevisiae reannotation: ftp://ftp/pub/yeast/SCreannotation)

    Discovery of an unconventional centromere in budding yeast redefines evolution of point centromeres

    Get PDF
    Centromeres are the chromosomal regions promoting kinetochore assembly for chromosome segregation. In many eukaryotes, the centromere consists of up to mega base pairs of DNA. On such "regional centromeres," kinetochore assembly is mainly defined by epigenetic regulation [1]. By contrast, a clade of budding yeasts (Saccharomycetaceae) has a "point centromere" of 120-200 base pairs of DNA, on which kinetochore assembly is defined by the consensus DNA sequence [2, 3]. During evolution, budding yeasts acquired point centromeres, which replaced ancestral, regional centromeres [4]. All known point centromeres among different yeast species share common consensus DNA elements (CDEs) [5, 6], implying that they evolved only once and stayed essentially unchanged throughout evolution. Here, we identify a yeast centromere that challenges this view: that of the budding yeast Naumovozyma castellii is the first unconventional point centromere with unique CDEs. The N. castellii centromere CDEs are essential for centromere function but have different DNA sequences from CDEs in other point centromeres. Gene order analyses around N. castellii centromeres indicate their unique, and separate, evolutionary origin. Nevertheless, they are still bound by the ortholog of the CBF3 complex, which recognizes CDEs in other point centromeres. The new type of point centromere originated prior to the divergence between N. castellii and its close relative Naumovozyma dairenensis and disseminated to all N. castellii chromosomes through extensive genome rearrangement. Thus, contrary to the conventional view, point centromeres can undergo rapid evolutionary changes. These findings give new insights into the evolution of point centromeres

    A new approach for obtaining rapid uniformity in rice (Oryza sativa L.) via a 3x x 2x cross

    Get PDF
    A triploid (2n = 3x = 36) rice plant was obtained by screening a twin seedling population in which each seed germinated to two or three sprouts that were then crossed with diploid plants. One diploid plant was chosen among the various F1 progenies and developed into an F 2 population via self-pollination. Compared with the control variety Shanyou 63, this F 2 population had a stable agronomical performance in field trials, as confirmed by the F-test. The stability of the F 2 population was further substantiated by molecular analysis with simple sequence repeat markers. Specifically, of 160 markers assayed, 37 (covering all 12 chromosomes) were polymorphic between the parental lines. Testing the F 1 hybrid individually with these markers showed that each PCR product had only a single band instead of two bands from each parent. The bands were identical to either maternal (23 markers) or paternal (eight markers) bands or distinct from both parents (six markers). The amplified bands of all 60 randomly selected F 2 plants were uniform and identical to those of the F 1 hybrid. These results suggest that the F 1 plant is a non-segregating hybrid and that a stable F 2 population was obtained. This novel system provides an efficient means for shortening the cycle of hybrid rice seed production

    Transcriptional plasticity through differential assembly of a multiprotein activation complex

    Get PDF
    Cell adaptation to the environment often involves induction of complex gene expression programs under the control of specific transcriptional activators. For instance, in response to cadmium, budding yeast induces transcription of the sulfur amino acid biosynthetic genes through the basic-leucine zipper activator Met4, and also launches a program of substitution of abundant glycolytic enzymes by isozymes with a lower content in sulfur. We demonstrate here that transcriptional induction of PDC6, which encodes a pyruvate decarboxylase isoform with low sulfur content, is directly controlled by Met4 and its DNA-binding cofactors the basic-helix–loop–helix protein Cbf1 and the two homologous zinc finger proteins Met31 and Met32. Study of Cbf1 and Met31/32 association with PDC6 allowed us to find a new mechanism of recruitment of Met4, which allows PDC6 being differentially regulated compared to sulfur amino acid biosynthetic genes. Our findings provide a new example of mechanism allowing transcriptional plasticity within a regulatory network thanks to a definite toolbox comprising a unique master activator and several dedicated DNA-binding cofactors. We also show evidence suggesting that integration of PDC6 to the Met4 regulon may have occurred recently in the evolution of the Saccharomyces cerevisiae lineage

    Fundamental issues in systems biology.

    Get PDF
    types: Journal Article; Research Support, Non-U.S. Gov'tIn the context of scientists' reflections on genomics, we examine some fundamental issues in the emerging postgenomic discipline of systems biology. Systems biology is best understood as consisting of two streams. One, which we shall call 'pragmatic systems biology', emphasises large-scale molecular interactions; the other, which we shall refer to as 'systems-theoretic biology', emphasises system principles. Both are committed to mathematical modelling, and both lack a clear account of what biological systems are. We discuss the underlying issues in identifying systems and how causality operates at different levels of organisation. We suggest that resolving such basic problems is a key task for successful systems biology, and that philosophers could contribute to its realisation. We conclude with an argument for more sociologically informed collaboration between scientists and philosophers.Funding received from the Economic and Social Research Council (ESRC), UK, and Overseas Conference Funding from the British Academy
    corecore