Search CORE

5 research outputs found

Processing genome-wide association studies within a repository of heterogeneous genomic datasets

Author: Bernasconi Anna
Canakoglu Arif
Comolli Federico
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2023
Field of study

Background Genome Wide Association Studies (GWAS) are based on the observation of genome-wide sets of genetic variants – typically single-nucleotide polymorphisms (SNPs) – in different individuals that are associated with phenotypic traits. Research efforts have so far been directed to improving GWAS techniques rather than on making the results of GWAS interoperable with other genomic signals; this is currently hindered by the use of heterogeneous formats and uncoordinated experiment descriptions. Results To practically facilitate integrative use, we propose to include GWAS datasets within the META-BASE repository, exploiting an integration pipeline previously studied for other genomic datasets that includes several heterogeneous data types in the same format, queryable from the same systems. We represent GWAS SNPs and metadata by means of the Genomic Data Model and include metadata within a relational representation by extending the Genomic Conceptual Model with a dedicated view. To further reduce the gap with the descriptions of other signals in the repository of genomic datasets, we perform a semantic annotation of phenotypic traits. Our pipeline is demonstrated using two important data sources, initially organized according to different data models: the NHGRI-EBI GWAS Catalog and FinnGen (University of Helsinki). The integration effort finally allows us to use these datasets within multisample processing queries that respond to important biological questions. These are then made usable for multi-omic studies together with, e.g., somatic and reference mutation data, genomic annotations, epigenetic signals. Conclusions As a result of our work on GWAS datasets, we enable 1) their interoperable use with several other homogenized and processed genomic datasets in the context of the META-BASE repository; 2) their big data processing by means of the GenoMetric Query Language and associated system. Future large-scale tertiary data analysis may extensively benefit from the addition of GWAS results to inform several different downstream analysis workflows

Archivio istituzionale della ricerca - Politecnico di Milano

Where to search top-K biomedical ontologies?

Author: Butt Anila Sahar
Haller Armin
Oliveira Daniela
Rebholz-Schuhmann Dietrich
Sahay Ratnesh
Publication venue: 'Oxford University Press (OUP)'
Publication date: 06/07/2018
Field of study

Motivation Searching for precise terms and terminological definitions in the biomedical data space is problematic, as researchers find overlapping, closely related and even equivalent concepts in a single or multiple ontologies. Search engines that retrieve ontological resources often suggest an extensive list of search results for a given input term, which leads to the tedious task of selecting the best-fit ontological resource (class or property) for the input term and reduces user confidence in the retrieval engines. A systematic evaluation of these search engines is necessary to understand their strengths and weaknesses in different search requirements. Result We have implemented seven comparable Information Retrieval ranking algorithms to search through ontologies and compared them against four search engines for ontologies. Free-text queries have been performed, the outcomes have been judged by experts and the ranking algorithms and search engines have been evaluated against the expert-based ground truth (GT). In addition, we propose a probabilistic GT that is developed automatically to provide deeper insights and confidence to the expert-based GT as well as evaluating a broader range of search queries. Conclusion The main outcome of this work is the identification of key search factors for biomedical ontologies together with search requirements and a set of recommendations that will help biomedical experts and ontology engineers to select the best-suited retrieval mechanism in their search scenarios. We expect that this evaluation will allow researchers and practitioners to apply the current search techniques more reliably and that it will help them to select the right solution for their daily work. Availability The source code (of seven ranking algorithms), ground truths and experimental results are available at https://github.com/danielapoliveira/bioont-search-benchmarkThis work has been supported by the Science Foundation Ireland (grant number SFI/12/RC/2289).peer-reviewed2019-03-2

Irish Universities

Access to Research at National University of Ireland, Galway

Where to search top-K biomedical ontologies?

Author: Butt Anila Sahar
Haller Armin
Oliveira Daniela
Rebholz-Schuhmann D
Sahay R
Publication venue: 'Oxford University Press (OUP)'
Publication date: 06/07/2020
Field of study

Motivation: Searching for precise terms and terminological definitions in the biomedical data space is problematic, as researchers find overlapping, closely related and even equivalent concepts in a single or multiple ontologies. Search engines that retrieve ontological resources often suggest an extensive list of search results for a given input term, which leads to the tedious task of selecting the best-fit ontological resource (class or property) for the input term and reduces user confidence in the retrieval engines. A systematic evaluation of these search engines is necessary to understand their strengths and weaknesses in different search requirements.This work has been supported by the Science Foundation Ireland (grant number SFI/12/RC/2289)

The Australian National University

Where to search top-K biomedical ontologies?

Author: Adamusiak
Alani
Anila Sahar Butt
Armin Haller
Ashburner
Avraham
Bard
Bodenreider
Buitelaar
Buitelaar
Butt
Courtot
Daniela Oliveira
Dietrich Rebholz-Schuhmann
Ding
d’Aquin
Gangemi
Gavankar
Gkoutos
Guarino
Haendel
Hanna
Harth
Hastings
Hayamizu
He
He
Hoehndorf
Horridge
Hu
Ison
Jupp
Kim
Kurbatova
Köhler
Lamiroy
Lozano-Tello
MacArthur
Malone
Marshall
Mungall
Page
Page
Park
Patel
Petryszak
Qu
Ratnesh Sahay
Robertson
Rosse
Salton
Salton
Schindelman
Schofield
Schriml
Segerdell
Sioutos
Smith
Smith
Suchanek
Szolovits
Thomas
Tummarello
Van Slyke
Wang
Whetzel
Zaragoza
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref