Search CORE

10 research outputs found

Application-based fault tolerance techniques for sparse matrix solvers

Author: Hunt Rob
McIntosh–Smith Simon
Price James
Vesztrocy Alex Warwick
Publication venue: 'SAGE Publications'
Publication date: 10/05/2017
Field of study

High-performance computing systems continue to increase in size in the quest for ever higher performance. The resulting increased electronic component count, coupled with the decrease in feature sizes of the silicon manufacturing processes used to build these components, may result in future exascale systems being more susceptible to soft errors caused by cosmic radiation than in current high-performance computing systems. Through the use of techniques such as hardware-based error-correcting codes and checkpoint-restart, many of these faults can be mitigated at the cost of increased hardware overhead, run-time, and energy consumption that can be as much as 10–20%. Some predictions expect these overheads to continue to grow over time. For extreme scale systems, these overheads will represent megawatts of power consumption and millions of dollars of additional hardware costs, which could potentially be avoided with more sophisticated fault-tolerance techniques. In this paper we present new software-based fault tolerance techniques that can be applied to one of the most important classes of software in high-performance computing: iterative sparse matrix solvers. Our new techniques enables us to exploit knowledge of the structure of sparse matrices in such a way as to improve the performance, energy efficiency, and fault tolerance of the overall solution. </jats:p

Crossref

Explore Bristol Research

The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces.

Author: Adrian M Altenhoff
Alex Warwick Vesztrocy
Charles Stevenson
Christophe Dessimoz
Clément-Marie Train
David Dylus
Gaston H Gonnet
Henning Redestig
Jiao Long
Karina Zile
Klara Kaleb
Natasha M Glover
Tarcisio M de Farias
The UniProt Consortium
Publication venue: 'Oxford University Press (OUP)'
Publication date: 27/10/2017
Field of study

The Orthologous Matrix (OMA) is a leading resource to relate genes across many species from all of life. In this update paper, we review the recent algorithmic improvements in the OMA pipeline, describe increases in species coverage (particularly in plants and early-branching eukaryotes) and introduce several new features in the OMA web browser. Notable improvements include: (i) a scalable, interactive viewer for hierarchical orthologous groups; (ii) protein domain annotations and domain-based links between orthologous groups; (iii) functionality to retrieve phylogenetic marker genes for a subset of species of interest; (iv) a new synteny dot plot viewer; and (v) an overhaul of the programmatic access (REST API and semantic web), which will facilitate incorporation of OMA analyses in computational pipelines and integration with other bioinformatic resources. OMA can be freely accessed at https://omabrowser.org

Repository for Publications and Research Data

Crossref

Serveur académique lausannois

UCL Discovery

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Author: Aashish Jain
Adrian Altenhoff
Ahmet S. Rifaioglu
Alan J. Medlar
Alberto Paccanaro
Alessandro Petrini
Alex A. Freitas
Alex W. Crocker
Alex Warwick Vesztrocy
Alexandra J. Lee
Alexandre Renaux
Alfonso E. Romero
Alfredo Benso
Alice C. McHardy
Alperen Dalkıran
Angela Wilkins
Asa Ben-Hur
Ashton R. Omdahl
Balint Z. Kacsoh
Branislava Gemovic
Burkhard Rost
Caleb Chandler
Casey S. Greene
Castrense Savojardo
Cen Wan
Chenguang Zhao
Chengxin Zhang
Christine A. Orengo
Christophe Dessimoz
Claire O’Donovan
Constance J. Jeffery
Da Chen Emily Koo
Daisuke Kihara
Dallas J. Larsen
Damiano Piovesan
Dane Jo
Daniel B. Roche
Danielle A. Brackenridge
David T. Jones
David W. Ritchie
Deborah A. Hogan
Devon Johnson
Domenico Cozzetto
Ehsaneddin Asgari
Elaine Zosa
Enrico Lavezzo
Erica Suh
Fabio Fabris
Farrokh Mehryary
Feng Zhang
Filip Ginter
Florian Boecker
Fran Supek
Gage S. Black
George Georghiou
Gianfranco Politano
Giorgio Valentini
Giovanni Bosco
Giuliano Grossi
Giuseppe Profiti
Hafeez Ur Rehman
Hai Fang
Haixuan Yang
Hans Moen
Heiko Schoof
Huy N. Nguyen
Ian Sillitoe
Iddo Friedberg
Ilya Novikov
Imane Boudellioua
Indika Kahanda
Itamar Borukhov
Jari Björne
Jeffrey M. Yunes
Jia-Ming Chang
Jianlin Cheng
Jie Hou
Jonas Reeb
Jonathan B. Dayton
Jonathan Gill Lees
Jose Manuel Rodriguez
José M. Fernández
Julian Gough
Kai Hakala
Kimberley A. Lewis
Larry Davis
Liam J. McGuffin
Liisa Holm
Magdalena Antczak
Marco Carraro
Marco Falda
Marco Frasca
Marco Mesiti
Marco Notaro
Maria J. Martin
Marie-Dominique Devignes
Mark N. Wass
Martti E.E. Tolvanen
Mateo Torres
Matteo Re
Maxat Kulmanov
Md Nafiz Hamid
Meet Barot
Michael L. Tress
Michal Linial
Michele Berselli
Miguel Amezola
Mohammad R.K. Mofrad
Naihui Zhou
Natalie Thurlby
Neven Sumonja
Nevena Veljkovic
Olivier Lichtarge
Paolo Fontana
Patricia C. Babbitt
Peter L. Freddolino
Peter W. Rose
Petri Törönen
Pier Luigi Martelli
Po-Han Chi
Prajwal Bhat
Predrag Radivojac
Qizhong Mao
Rabie Saidi
Radoslav S. Davidović
Rebecca L. Hurto
Rengul Cetin Atalay
Renzhi Cao
Richard Bonneau
Rita Casadio
Robert Hoehndorf
Ronghui You
Rui Fa
Sabeur Aridhi
Saso Dzeroski
Sayoni Das
Sean D. Mooney
Seyed Ziaeddin Alborzi
Shanfeng Zhu
Shanshan Zhang
Shuwei Yao
Silvio C.E. Tosatto
Slobodan Vucetic
Stefano Di Carlo
Stefano Pascarelli
Stefano Toppo
Steven E. Brenner
Suwisa Kaewphan
Suyang Dai
Tapio Salakoski
Tatyana Goldberg
Timothy R. Bergquist
Tomislav Šmuc
Tunca Dogan
Vedrana Vidulin
Vladimir Gligorijević
Vladimir R. Perovic
Volkan Atalay
Wei-Cheng Tseng
Weidong Tian
Wen-Hung Liao
Yang Zhang
Yi-Wei Liu
Yotam Frank
Yuxiang Jiang
Zheng Wang
Zihan Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/10/2022
Field of study

BackgroundThe Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.ResultsHere, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory.ConclusionWe conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.</p

UTUPub

Prioritising candidate genes causing QTL using hierarchical orthologous groups

Author: Dessimoz Christophe
Redestig Henning
Warwick Vesztrocy Alex
Publication venue
Publication date: 05/10/2021
Field of study

Abstract Motivation A key goal in plant biotechnology applications is the identification of genes associated to particular phenotypic traits (for example: yield, fruit size, root length). Quantitative Trait Loci (QTL) studies identify genomic regions associated with a trait of interest. However, to infer potential causal genes in these regions, each of which can contain hundreds of genes, these data are usually intersected with prior functional knowledge of the genes. This process is however laborious, particularly if the experiment is performed in a non-model species, and the statistical significance of the inferred candidates is typically unknown. Results This paper introduces QTLSearch, a method and software tool to search for candidate causal genes in QTL studies by combining Gene Ontology annotations across many species, leveraging hierarchical orthologous groups. The usefulness of this approach is demonstrated by re-analysing two metabolic QTL studies: one in Arabidopsis thaliana, the other in Oryza sativa subsp. indica. Even after controlling for statistical significance, QTLSearch inferred potential causal genes for more QTL than BLAST-based functional propagation against UniProtKB/Swiss-Prot, and for more QTL than in the original studies. Availability and implementation QTLSearch is distributed under the LGPLv3 license. It is available to install from the Python Package Index (as qtlsearch), with the source available from https://bitbucket.org/alex-warwickvesztrocy/qtlsearch. Supplementary information Supplementary data are available at Bioinformatics online

RERO DOC Digital Library

Multifaceted quality assessment of gene repertoire annotation with OMArk

Author: Adrian Altenhoff
Christophe Dessimoz
Clément Train
Natasha Glover
Victor Rossier
Warwick Vesztrocy Alex
Yannis Nevers
Publication venue: Zenodo
Publication date: 23/10/2023
Field of study

Dataset associated to the OMArk paper.Contain eight archives:Supplementary_TablesThe Supplementary Table files referred to in the paperOMAmerDB:The OMAmer database constructed using the whole dataset of the OMA database (November 2022 Release) and used in the paper. An OMAmer database is necessary to run OMArk.Simulation: Proteomes with artificially introduced errors, contaminants or depleted completeness, used to assess OMArk's performance. The archive contains the generated proteomes (Simulated_Data) and their OMArk quality assessments (omark). They also contains the OMAmer results (OMAmerResults) that were used to run OMArk and BUSCO completeness assessments (BUSCO).*Note that for storage efficiency, only the non-redundant part of the data (added errors, added contamination, random fraction of proteomes) are stored there. The full modified proteome can be regenerated from these data and the source proteomes.Reference Proteomes:The UniProt Reference Proteomes (Proteomes) (2021_04) and their proteome quality assesment results according to OMArk. The archive contains the source proteome FASTA (Source folder),  OMAmer results for these proteomes (omamer folder) , OMArk results (omark folder), and BUSCO completeness assesments (BUSCO folder). It also contains a subfolder that contains part of the Contamination detection experiment (Contamination folder).Ensembl_Metazoa_AssemblyChange. Contains Ensembl Metazoa proteomes with version change between version 52 and 54 as well as their quality assesment resuls for both version. The archive contains the source proteomes FASTA (Source folder), a Splice file that group together all proteins coded by the same gene (Splice folder), omamer results for the proteomes (omamer folder) and the omark results (omark folder)MissingGenesBLAST Contains sequences of HOGs considered as missing in the Human proteome, that was used to look for sequences in the human genome.Ensembl_NCBI_ResultsContains OMArk and BUSCO results for Ensembl and NCBI proteomes. These results were then used to evaluate OMArk biais due to source of proteomes in the OMA database.Notebooks Jupyter Notebooks that were used to perform the analysis described in the paper  </p&gt

ZENODO

Application-based fault tolerance techniques for sparse matrix solvers

Author: Alex Warwick Vesztrocy
Bergman K
Fang YP
James Price
Rob Hunt
Simon McIntosh–Smith
Warren H
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Quality assessment of gene repertoire annotations with OMArk

Author: Altenhoff Adrian Michael
Dessimoz Christophe
Glover Natasha M.
Nevers Yannis
Rossier Victor
Train Clément-Marie
Warwick Vesztrocy Alex
Publication venue: Nature
Publication date: 01/01/2024
Field of study

In the era of biodiversity genomics, it is crucial to ensure that annotations of protein-coding gene repertoires are accurate. State-of-the-art tools to assess genome annotations measure the completeness of a gene repertoire but are blind to other errors, such as gene overprediction or contamination. We introduce OMArk, a software package that relies on fast, alignment-free sequence comparisons between a query proteome and precomputed gene families across the tree of life. OMArk assesses not only the completeness but also the consistency of the gene repertoire as a whole relative to closely related species and reports likely contamination events. Analysis of 1,805 UniProt Eukaryotic Reference Proteomes with OMArk demonstrated strong evidence of contamination in 73 proteomes and identified error propagation in avian gene annotation resulting from the use of a fragmented zebra finch proteome as a reference. This study illustrates the importance of comparing and prioritizing proteomes based on their quality measures.ISSN:1546-1696ISSN:1087-015

Repository for Publications and Research Data

The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces

Author: Adrian M Altenhoff
Alex Warwick Vesztrocy
Charles Stevenson
Christophe Dessimoz
Clément-Marie Train
David Dylus
Gaston H Gonnet
Henning Redestig
Jiao Long
Karina Zile
Klara Kaleb
Natasha M Glover
Tarcisio M de Farias
The UniProt Consortium
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Recommended from our members

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.

Author: Alborzi Seyed Ziaeddin
Antczak Magdalena
Aridhi Sabeur
Asgari Ehsaneddin
Atalay Volkan
Barot Meet
Bergquist Timothy R
Bhat Prajwal
Boecker Florian
Bonneau Richard
Borukhov Itamar
Casadio Rita
Cetin Atalay Rengul
Cheng Jianlin
Chi Po-Han
Cozzetto Domenico
Crocker Alex W
Dalkıran Alperen
Das Sayoni
Davidović Radoslav S
Davis Larry
Dessimoz Christophe
Devignes Marie-Dominique
Dogan Tunca
Dzeroski Saso
Fa Rui
Fabris Fabio
Fang Hai
Fernández José M
Frasca Marco
Freddolino Peter L
Freitas Alex A
Gemovic Branislava
Georghiou George
Gligorijević Vladimir
Goldberg Tatyana
Gough Julian
Grossi Giuliano
Hamid Md Nafiz
Holm Liisa
Hou Jie
Hurto Rebecca L
Jiang Yuxiang
Jones David T
Kacsoh Balint Z
Kahanda Indika
Koo Da Chen Emily
Lavezzo Enrico
Lee Alexandra J
Lees Jonathan Gill
Lewis Kimberley A
Lichtarge Olivier
Linial Michal
Martelli Pier Luigi
McHardy Alice C
Medlar Alan J
Mesiti Marco
Mofrad Mohammad RK
Nguyen Huy N
Notaro Marco
Novikov Ilya
Paccanaro Alberto
Perovic Vladimir R
Petrini Alessandro
Profiti Giuseppe
Re Matteo
Reeb Jonas
Renaux Alexandre
Rifaioglu Ahmet S
Ritchie David W
Roche Daniel B
Rodriguez Jose Manuel
Romero Alfonso E
Rose Peter W
Saidi Rabie
Savojardo Castrense
Schoof Heiko
Sillitoe Ian
Sumonja Neven
Supek Fran
Thurlby Natalie
Toppo Stefano
Torres Mateo
Tress Michael L
Tseng Wei-Cheng
Törönen Petri
Valentini Giorgio
Veljkovic Nevena
Vidulin Vedrana
Wan Cen
Wang Zheng
Warwick Vesztrocy Alex
Wass Mark N
Wilkins Angela
Yang Haixuan
Zhang Chengxin
Zhang Yang
Zhao Chenguang
Zhou Naihui
Zosa Elaine
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

eScholarship - University of California