Search CORE

2 research outputs found

APPRIS: selecting functionally important isoforms.

Author: Cerdán-Vélez Daniel
Di Domenico Tomás
Pozo Fernando
Rodriguez Jose Manuel
Tress Michael L
Vázquez Jesús
Publication venue: 'Oxford University Press (OUP)'
Publication date: 10/11/2021
Field of study

APPRIS (https://appris.bioinfo.cnio.es) is a well-established database housing annotations for protein isoforms for a range of species. APPRIS selects principal isoforms based on protein structure and function features and on cross-species conservation. Most coding genes produce a single main protein isoform and the principal isoforms chosen by the APPRIS database best represent this main cellular isoform. Human genetic data, experimental protein evidence and the distribution of clinical variants all support the relevance of APPRIS principal isoforms. APPRIS annotations and principal isoforms have now been expanded to 10 model organisms. In this paper we highlight the most recent updates to the database. APPRIS annotations have been generated for two new species, cow and chicken, the protein structural information has been augmented with reliable models from the EMBL-EBI AlphaFold database, and we have substantially expanded the confirmatory proteomics evidence available for the human genome. The most significant change in APPRIS has been the implementation of TRIFID functional isoform scores. TRIFID functional scores are assigned to all splice isoforms, and APPRIS uses the TRIFID functional scores and proteomics evidence to determine principal isoforms when core methods cannot.National Human Genome Research Institute of the National Institutes of Health [2 U41 HG007234]; Spanish Ministry of Science, Innovation and Universities [PGC2018-097019-B-I00]; Carlos III Institute of Health-Fondo de Investigacion Sanitaria [PRB3 ´ (IPT17/0019––ISCIII-SGEFI/ERDF, ProteoRed]; ‘la Caixa’ Banking Foundation [HR17-00247]. Funding for open access charge: National Human Genome Research Institute.S

PubMed Central

REPISALUD

GENCODE: reference annotation for the human and mouse genomes in 2023.

Author: Arnan Carme
Banerjee Abhimanyu
Barnes If
Bennett Ruth
Berry Andrew
Bignell Alexandra
Boix Carles
Calvet Ferriol
Carbonell-Sala Sílvia
Cerdán-Vélez Daniel
Choudhary Jyoti S
Cunningham Fiona
Davidson Claire
Diekhans Mark
Donaldson Sarah
Dursun Cagatay
Fatima Reham
Flicek Paul
Frankish Adam
Gerstein Mark
Giorgetti Stefano
Giron Carlos Garcıa
Gonzalez Jose Manuel
Guigo Roderic
Gómez Laura Martínez
Hardy Matthew
Harrison Peter W
Hollis Zoe
Hourlier Thibaut
Hubbard Tim J P
Hunt Toby
James Benjamin
Jiang Yunzhe
Johnson Rory
Jungreis Irwin
Kay Mike
Kellis Manolis
Kundaje Anshul
Lagarde Julien
Loveland Jane E
Martin Fergal J
Mudge Jonathan M
Nair Surag
Ni Pengyu
Paten Benedict
Pozo Fernando
Ramalingam Vivek
Ruffier Magali
Schmitt Bianca M
Schreiber Jacob M
Sisu Cristina
Steed Emily
Sumathipala Dulika
Suner Marie-Marthe
Sycheva Irina
Tress Michael L
Uszczynska-Ratajczak Barbara
Wass Elizabeth
Wright James C
Yang Yucheng T
Yates Andrew
Zafrulla Zahoor
Publication venue: 'Oxford University Press (OUP)'
Publication date: 24/11/2022
Field of study

GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Here, we present an update on the annotation of human and mouse genes, including developments in the tools, data, analyses and major collaborations which underpin this progress. For example, we report the creation of a set of non-canonical ORFs identified in GENCODE transcripts, the LRGASP collaboration to assess the use of long transcriptomic data to build transcript models, the progress in collaborations with RefSeq and UniProt to increase convergence in the annotation of human and mouse protein-coding genes, the propagation of GENCODE across the human pan-genome and the development of new tools to support annotation of regulatory features by GENCODE. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org

Bern Open Repository and Information System (BORIS)