27 research outputs found

    GENCODE: reference annotation for the human and mouse genomes in 2023.

    Get PDF
    GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Here, we present an update on the annotation of human and mouse genes, including developments in the tools, data, analyses and major collaborations which underpin this progress. For example, we report the creation of a set of non-canonical ORFs identified in GENCODE transcripts, the LRGASP collaboration to assess the use of long transcriptomic data to build transcript models, the progress in collaborations with RefSeq and UniProt to increase convergence in the annotation of human and mouse protein-coding genes, the propagation of GENCODE across the human pan-genome and the development of new tools to support annotation of regulatory features by GENCODE. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org

    GENCODE reference annotation for the human and mouse genomes

    Get PDF
    The accurate identification and description of the genes in the human and mouse genomes is a fundamental requirement for high quality analysis of data informing both genome biology and clinical genomics. Over the last 15 years, the GENCODE consortium has been producing reference quality gene annotations to provide this foundational resource. The GENCODE consortium includes both experimental and computational biology groups who work together to improve and extend the GENCODE gene annotation. Specifically, we generate primary data, create bioinformatics tools and provide analysis to support the work of expert manual gene annotators and automated gene annotation pipelines. In addition, manual and computational annotation workflows use any and all publicly available data and analysis, along with the research literature to identify and characterise gene loci to the highest standard. GENCODE gene annotations are accessible via the Ensembl and UCSC Genome Browsers, the Ensembl FTP site, Ensembl Biomart, Ensembl Perl and REST APIs as well as https://www.gencodegenes.org.National Human Genome Research Institute of the National Institutes of Healt

    Multi-tissue integrative analysis of personal epigenomes

    Get PDF
    Evaluating the impact of genetic variants on transcriptional regulation is a central goal in biological science that has been constrained by reliance on a single reference genome. To address this, we constructed phased, diploid genomes for four cadaveric donors (using long-read sequencing) and systematically charted noncoding regulatory elements and transcriptional activity across more than 25 tissues from these donors. Integrative analysis revealed over a million variants with allele-specific activity, coordinated, locus-scale allelic imbalances, and structural variants impacting proximal chromatin structure. We relate the personal genome analysis to the ENCODE encyclopedia, annotating allele- and tissue-specific elements that are strongly enriched for variants impacting expression and disease phenotypes. These experimental and statistical approaches, and the corresponding EN-TEx resource, provide a framework for personalized functional genomics

    Embodied Perspectives on Sculpture: A Case Study in Interdisciplinary Research

    No full text
    The International Community for Movement (ICM) aims to connect and extend the field of actor movement by offering workshops, discussion and talks for professional movement practitioners and actors. The ICM acts as a gathering point for the development of movement teachers, movement directors and other movement practitioners working in theatre, actor training and allied fields. The MA/MFA Movement: Directing and Teaching course leaders have curated the ICM; the combined specialisms of Vanessa Ewan and Ayse Tashkiran of actor movement pedagogy and movement direction pedagogy inform the events of this community. In the pilot phase we have run a series of creative, teaching research projects and workshops that have shared practices, as well as initiated and responded to questions arising from movement direction and actor movement

    Determinants of sustained stabilization of beta-cell function following short-term insulin therapy in type 2 diabetes

    No full text
    Abstract In early type 2 diabetes, the strategy of “induction” with short-term intensive insulin therapy followed by “maintenance” with metformin can stabilize pancreatic beta-cell function in some patients but not others. We thus sought to elucidate determinants of sustained stabilization of beta-cell function. In this secondary analysis of ClinicalTrials.Gov NCT02192424, adults with ≀5-years diabetes duration were randomized to 3-weeks induction insulin therapy (glargine/lispro) followed by metformin maintenance either with or without intermittent 2-week courses of insulin every 3-months for 2-years. Sustained stabilization (higher beta-cell function at 2-years than at baseline) was achieved in 55 of 99 participants. Independent predictors of sustained stabilization were the change in beta-cell function during induction and changes in hepatic insulin resistance and alanine aminotransferase during maintenance. Thus, initial reversibility of beta-cell dysfunction during induction and subsequent preservation of hepatic insulin sensitivity during maintenance are associated with sustained stabilization of beta-cell function following short-term insulin and metformin. ClinicalTrials.Gov NCT0219242

    The Relationship between Intramuscular Adipose Tissue, Functional Mobility, and Strength in Postmenopausal Women with and without Type 2 Diabetes

    No full text
    Objectives. To determine (1) whether intramuscular adipose tissue (IntraMAT) differs between women with and without type 2 diabetes and (2) the association between IntraMAT and mobility and strength. Methods. 59 women ≄ 65 years with and without type 2 diabetes were included. A 1-Tesla MRI was used to acquire images of the leg. Timed-up-and-go (TUG) and grip strength were measured. Regression was used to determine associations between the following: (1) type 2 diabetes and IntraMAT (covariates: age, ethnicity, BMI, waist : hip ratio, and energy expenditure), (2) IntraMAT and TUG (covariates: diabetes, age, BMI, and energy expenditure), and (3) IntraMAT and grip strength (covariates: diabetes, age, height, and lean mass). Results. Women with diabetes had more IntraMAT. After adjustment, IntraMAT was similar between groups (diabetes mean [SD] = 13.2 [1.4]%, controls 11.8 [1.3]%, P=0.515). IntraMAT was related to TUG and grip strength, but the relationships became nonsignificant after adjustment for covariates (difference/percent IntraMAT [95% CI]: TUG = 0.041 seconds [−0.079–0.161], P=0.498, grip strength = −0.144 kg [−0.335–0.066], P=0.175). Conclusions. IntraMAT alone may not be a clinically important predictor of functional mobility and strength; however, whether losses in functional mobility and strength are promoted by IntraMAT accumulation should be explored

    Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.

    No full text
    Virtually all genome sequencing efforts in national biobanks, complex and Mendelian disease programs, and medical genetic initiatives are reliant upon short-read whole-genome sequencing (srWGS), which presents challenges for the detection of structural variants (SVs) relative to emerging long-read WGS (lrWGS) technologies. Given this ubiquity of srWGS in large-scale genomics initiatives, we sought to establish expectations for routine SV detection from this data type by comparison with lrWGS assembly, as well as to quantify the genomic properties and added value of SVs uniquely accessible to each technology. Analyses from the Human Genome Structural Variation Consortium (HGSVC) of three families captured ~11,000 SVs per genome from srWGS and ~25,000 SVs per genome from lrWGS assembly. Detection power and precision for SV discovery varied dramatically by genomic context and variant class: 9.7% of the current GRCh38 reference is defined by segmental duplication (SD) and simple repeat (SR), yet 91.4% of deletions that were specifically discovered by lrWGS localized to these regions. Across the remaining 90.3% of reference sequence, we observed extremely high (93.8%) concordance between technologies for deletions in these datasets. In contrast, lrWGS was superior for detection of insertions across all genomic contexts. Given that non-SD/SR sequences encompass 95.9% of currently annotated disease-associated exons, improved sensitivity from lrWGS to discover novel pathogenic deletions in these currently interpretable genomic regions is likely to be incremental. However, these analyses highlight the considerable added value of assembly-based lrWGS to create new catalogs of insertions and transposable elements, as well as disease-associated repeat expansions in genomic sequences that were previously recalcitrant to routine assessment
    corecore