31 research outputs found

    GENCODE: reference annotation for the human and mouse genomes in 2023.

    Get PDF
    GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Here, we present an update on the annotation of human and mouse genes, including developments in the tools, data, analyses and major collaborations which underpin this progress. For example, we report the creation of a set of non-canonical ORFs identified in GENCODE transcripts, the LRGASP collaboration to assess the use of long transcriptomic data to build transcript models, the progress in collaborations with RefSeq and UniProt to increase convergence in the annotation of human and mouse protein-coding genes, the propagation of GENCODE across the human pan-genome and the development of new tools to support annotation of regulatory features by GENCODE. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org

    Ensembl’s 10th year

    Get PDF
    Ensembl (http://www.ensembl.org) integrates genomic information for a comprehensive set of chordate genomes with a particular focus on resources for human, mouse, rat, zebrafish and other high-value sequenced genomes. We provide complete gene annotations for all supported species in addition to specific resources that target genome variation, function and evolution. Ensembl data is accessible in a variety of formats including via our genome browser, API and BioMart. This year marks the tenth anniversary of Ensembl and in that time the project has grown with advances in genome technology. As of release 56 (September 2009), Ensembl supports 51 species including marmoset, pig, zebra finch, lizard, gorilla and wallaby, which were added in the past year. Major additions and improvements to Ensembl since our previous report include the incorporation of the human GRCh37 assembly, enhanced visualisation and data-mining options for the Ensembl regulatory features and continued development of our software infrastructure

    Sequencing of the Sea Lamprey (Petromyzon marinus) Genome Provides Insights into Vertebrate Evolution

    Get PDF
    Lampreys are representatives of an ancient vertebrate lineage that diverged from our own ∼500 million years ago. By virtue of this deeply shared ancestry, the sea lamprey (P. marinus) genome is uniquely poised to provide insight into the ancestry of vertebrate genomes and the underlying principles of vertebrate biology. Here, we present the first lamprey whole-genome sequence and assembly. We note challenges faced owing to its high content of repetitive elements and GC bases, as well as the absence of broad-scale sequence information from closely related species. Analyses of the assembly indicate that two whole-genome duplications likely occurred before the divergence of ancestral lamprey and gnathostome lineages. Moreover, the results help define key evolutionary events within vertebrate lineages, including the origin of myelin-associated proteins and the development of appendages. The lamprey genome provides an important resource for reconstructing vertebrate origins and the evolutionary events that have shaped the genomes of extant organisms

    Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and Analysis

    Get PDF
    The combined application of next-generation sequencing platforms has provided an economical approach to unlocking the potential of the turkey genome

    Genetic effects on gene expression across human tissues

    Get PDF
    Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of diseas

    GENCODE reference annotation for the human and mouse genomes

    Get PDF
    The accurate identification and description of the genes in the human and mouse genomes is a fundamental requirement for high quality analysis of data informing both genome biology and clinical genomics. Over the last 15 years, the GENCODE consortium has been producing reference quality gene annotations to provide this foundational resource. The GENCODE consortium includes both experimental and computational biology groups who work together to improve and extend the GENCODE gene annotation. Specifically, we generate primary data, create bioinformatics tools and provide analysis to support the work of expert manual gene annotators and automated gene annotation pipelines. In addition, manual and computational annotation workflows use any and all publicly available data and analysis, along with the research literature to identify and characterise gene loci to the highest standard. GENCODE gene annotations are accessible via the Ensembl and UCSC Genome Browsers, the Ensembl FTP site, Ensembl Biomart, Ensembl Perl and REST APIs as well as https://www.gencodegenes.org.National Human Genome Research Institute of the National Institutes of Healt

    Genetic effects on gene expression across human tissues

    Get PDF
    Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease

    Ensembl core software resources : storage and programmatic access for DNA sequence and genome annotation

    No full text
    The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl 'Core' database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all of our genome resources. Since its initial design more than fifteen years ago, the number of publicly available genomic, transcriptomic and proteomic datasets has grown enormously, accelerated by continuous advances in DNA-sequencing technology. Initially intended to provide annotation for the reference human genome, we have extended our framework to support the genomes of all species as well as richer assembly models. Cross-referenced links to other informatics resources facilitate searching our database with a variety of popular identifiers such as UniProt and RefSeq. Our comprehensive and robust framework storing a large diversity of genome annotations in one location serves as a platform for other groups to generate and maintain their own tailored annotation. We welcome reuse and contributions: our databases and APIs are publicly available, all of our source code is released with a permissive Apache v2.0 licence at http://github.com/Ensembl and we have an active developer mailing list (http://www.ensembl.org/info/about/contact/index.html)

    Cost-effectiveness of weekly adaptive radiotherapy versus standard IMRT in head and neck cancer alongside the ARTIX trial

    No full text
    International audienceBACKGROUND AND PURPOSE: We performed a cost-effectiveness analysis (CEA) comparing an adaptive radiotherapy (ART) strategy, based on weekly replanning, aiming to correct the parotid gland overdose during treatment and expecting therefore to decrease xerostomia, when compared to a standard IMRT. MATERIALS AND METHODS: We conducted the ARTIX trial, a randomized, parallel-group, multicentric study comparing a systematic weekly replanning ART to a standard IMRT. The primary endpoint was the frequency of xerostomia at 12 months, measured by stimulating salivary flow with paraffin. The CEA was designed alongside the ARTIX trial which was linked to the French national health data system (SNDS). For each patient, healthcare consumptions and costs were provided by the SNDS. The reference case analysis was based on the primary endpoint of the trial. Sensitivity and scenario analyses were performed. RESULTS: Of the 129 patients randomly assigned between 2013 and 2018, only 2 records were not linked to the SNDS, which provides a linkage proportion of 98.4%. All of the other 127 records were linked with good to very good robustness. On the intent-to-treat population at 12 months, mean total costs per patient were €41,564 (SD 23,624) and €33,063 (SD 16,886) for ART and standard IMRT arms, respectively (p = 0.033). Incremental cost effectiveness ratio (ICER) was €162,444 per xerostomia avoided. At 24 months, ICER was €194,521 per xerostomia avoided. For both progression-free and overall survival, ART was dominated by standard IMRT. CONCLUSION: The ART strategy was deemed to be not cost-effective compared with standard IMRT for patients with locally advanced oropharyngeal cancer

    Randomized phase II trial in prostate cancer with hormone-sensitive OligometaSTatic relapse: Combining stereotactic ablative radiotherapy and durvalumab (POSTCARD GETUG P13): Study protocol

    No full text
    Background: As in other solid tumors, increasing evidence indicates that patients diagnosed with a limited number of prostate cancer metastases, so-called oligometastases, have a better prognosis than patients with extensive metastatic disease. Stereotactic body radiotherapy (SBRT) is now considered an option in this population.Programmed death-1 (PD-1) and its ligands (PD-L1) are targeted by immune checkpoint inhibitors. Preclinical studies have shown that the tumor immune microenvironment changes when combining radiotherapy with immunotherapy, especially with hypofractionated radiotherapy.The oligometastatic setting appears to be the most relevant clinical situation for evaluating the immune response generated by radiotherapy and immune checkpoint inhibitors in patients with an intact immune system.We hypothesize that durvalumab will enhance the immune response following SBRT targeting oligometastatic lesions. Our purpose is to demonstrate, via a randomized 2:1 phase II trial, that SBRT (3 fractions) with durvalumab in oligometastatic hormone-sensitive prostate cancer patients would improve progression-free survival in patients with prostate cancer with up to 5 metastases compared to patients who exclusively received SBRT. Methods: This is a multicentric randomized phase II study in French academic hospitals. Patients with prostate cancer and up to 5 metastases (lymph node and/or bone) were randomized into a 2:1 ratio between Arm A (experimental group), corresponding to durvalumab and SBRT to the metastases, and Arm B (control group), corresponding to SBRT alone to the metastases. The study aims to accrue a total of 96 patients within 3 years. The primary endpoint is two-year progression-free survival and secondary endpoints include androgen deprivation therapy-free survival, quality of life, toxicity, prostate cancer specific survival, overall survival, and immune response. Discussion: The expected benefit for the patients in the experimental arm is longer life expectancy with acceptable toxicity. We also expect our study to provide data for better understanding the synergy between immunotherapy and radiotherapy in oligometastatic prostate cancer
    corecore