Search CORE

7 research outputs found

Recommended from our members

Taxonomy Builder: A Data-driven and User-centric Tool for Streamlining Taxonomy Construction

Author: Alcock K.
Andrews W.
Bethard S.
Chan Y.S.
Gyori B.M.
Hilverman C.
Hungerford J.
Laparra E.
MacBride J.
Min B.
Qiu H.
Reynolds M.
Sharp R.
Surdeanu M.
Tang Z.
Thomas M.
Zhang Z.
Zupon A.
Zverev Y.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2022
Field of study

An existing domain taxonomy for normalizing content is often assumed when discussing approaches to information extraction, yet often in real-world scenarios there is none. When one does exist, as the information needs shift, it must be continually extended. This is a slow and tedious task, and one that does not scale well. Here we propose an interactive tool that allows a taxonomy to be built or extended rapidly and with a human in the loop to control precision. We apply insights from text summarization and information extraction to reduce the search space dramatically, then leverage modern pretrained language models to perform contextualized clustering of the remaining concepts to yield candidate nodes for the user to review. We show this allows a user to consider as many as 200 taxonomy concept candidates an hour to quickly build or extend a taxonomy to better fit information needs. © 2022 Association for Computational Linguistics.Open access journalThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

The University of Arizona

Unifying the identification of biomedical entities with the Bioregistry

Author: Balk M.
Callahan T.J.
Domingo-Fernández D.
Gyori B.M.
Haendel M.A.
Hegde H.B.
Himmelstein D.S.
Hoyt C.T.
Karis K.
Kunze J.
Lubiana T.
Matentzoglu N.
McMurry J.
Moxon S.
Mungall C.J.
Rutz A.
Unni D.R.
Willighagen E.
Winston D.
Publication venue: Nature Research
Publication date: 01/01/2022
Field of study

The standardized identification of biomedical entities is a cornerstone of interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for biomedical entities such as small molecules, proteins, cell lines, and clinical trials. However, existing registries have struggled to provide sufficient coverage and metadata standards that meet the evolving needs of modern life sciences researchers. Here, we introduce the Bioregistry, an integrative, open, community-driven metaregistry that synthesizes and substantially expands upon 23 existing registries. The Bioregistry addresses the need for a sustainable registry by leveraging public infrastructure and automation, and employing a progressive governance model centered around open code and open data to foster community contribution. The Bioregistry can be used to support the standardized annotation of data, models, ontologies, and scientific literature, thereby promoting their interoperability and reuse. The Bioregistry can be accessed through https://bioregistry.io and its source code and data are available under the MIT and CC0 Licenses at https://github.com/biopragmatics/bioregistry. © 2022, The Author(s).Open access journalThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

Maastricht University Research Portal

PubMed Central

The University of Arizona

eScholarship - University of California

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Recommended from our members

Unifying the identification of biomedical entities with the Bioregistry

Author: Balk M.
Callahan T.J.
Domingo-Fernández D.
Gyori B.M.
Haendel M.A.
Hegde H.B.
Himmelstein D.S.
Hoyt C.T.
Karis K.
Kunze J.
Lubiana T.
Matentzoglu N.
McMurry J.
Moxon S.
Mungall C.J.
Rutz A.
Unni D.R.
Willighagen E.
Winston D.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

The University of Arizona

A Simple Standard for Sharing Ontological Mappings (SSSOM)

Author: Balhoff J.P.
Bello S.M.
Bizon C.
Brush M.
Callahan T.J.
Chute C.G.
Duncan W.D.
Evelo C.T.
Gabriel D.
Gray A.
Graybeal J.
Gyori B.M.
Haendel M.
Harmse H.
Harris N.L.
Harrow I.
Hegde H.B.
Hoyt A.L.
Hoyt C.T.
Jiao D.Z.
Jimenez-Ruiz E.
Jupp S.
Kim H.
Koehler S.
Liener T.
Long Q.Q.
Malone J.
Matentzoglu N.
McLaughlin J.A.
McMurry J.A.
Moxon S.
Mungall C.J.
Munoz-Torres M.C.
Osumi-Sutherland D.
Overton J.A.
Peters B.
Putman T.
Queralt Rosinach N.
Shefchek K.
Solbrig H.
Thessen A.
Tudorache T.
Vasilevsky N.
Wagner A.H.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 25/05/2022
Field of study

Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Or are they associated in some other way? Such relationships between the mapped terms are often not documented, which leads to incorrect assumptions and makes them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Furthermore, the lack of descriptions of how mappings were done makes it hard to combine and reconcile mappings, particularly curated and automated ones. We have developed the Simple Standard for Sharing Ontological Mappings (SSSOM) which addresses these problems by: (i) Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in mappings explicit. (ii) Defining an easy-to-use simple table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data principles. (iii) Implementing open and community-driven collaborative workflows that are designed to evolve the standard continuously to address changing requirements and mapping practices. (iv) Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases in detail and survey some of the existing work on standardizing the exchange of mappings, with the goal of making mappings Findable, Accessible, Interoperable and Reusable (FAIR). The SSSOM specification can be found at http://w3id.org/sssom/spec

Leiden University Scholary Publications

Data Integration of Hybrid Microarray and Single Cell Expression Data to Enhance Gene Network Inference

Author: Amaral M.E.A.
Banf M.
Barzel B.
Bitencourt-Ferreira G.
Bower JM
Castillo D.
Chan TE
Chen S.
Curtis C.
de Ávila M.B.
Edgar R.
Feizi S.
Ghanat Bari M.
Gyori B.M.
Huynh-Thu V.A.
Huynh-Thu V.A.
Huynh-Thu V.A.
Hwang D.
Imam S.
Jianming Zhang
Kholodenko B.
Lam K.Y.
Levin N.M.B.
Lim N.
Lin D.
Liu L.Z.
Ma T.
Marbach D.
Matsumoto H.
Metzger-Filho O.
Nascimento M.
Ning Wang
Nookaew I.
Ocone A.
Park S.
Petralia F.
Rodrigo G.
Schaffter T.
Sławek J.
Tabe-Bordbar S.
Tibshirani R.
Wei Zhang
Wenchao Li
Xavier M.M.
Zarayeneh N.
Publication venue: 'Bentham Science Publishers Ltd.'
Publication date
Field of study

Crossref

Clinical Trial Data Management Software: A Review of the Technical Features

Author: Amorim R.C.
Arab L.
Aynaz Nourani
Barr R
Bebis G.
Bryan HE
Buchsbaum R.
Buxmann P.
Cavenaugh J.S.
Cramon P.
Das S.
Dezeuze A.
Durkalski V.
Fraccaro P.
Friedman L.M.
Fu L.
Gao Q-B.
Gazali Kaur.S.
Gorrell L.M.
Gyori B.M.
Haleh Ayatollahi
Hou Z.
James K.L.
Krishnankutty B.
Kruse R.L.
Lee H.
Li Z.
Lin S.
Lu Z.
Masoud Solaymani Dodaran
McGraw M.J.
Micard E.
Mouratidou M.
Musick B.S.
Müller J.
Nash F.
Ngari M.M.
Nourani A.
Ohmann C.
Oluwatosin H.S.
Park J.Y.
Payne P.R.
Popp K
Pozamantir A.
Prokscha S.
Ratib O.
Rorie D.A.
St Germain D.C.
Stenzhorn H.
Tran V-A.
Vojtáš P.
Wang X.
Weiler G.
Wen W.
Wilson A.S.
Wu Y.
Zasada S.J.
Publication venue: 'Bentham Science Publishers Ltd.'
Publication date
Field of study

Crossref

COVID19 Disease Map, a computational knowledge repository of virus-host interaction mechanisms.

Author: Acencio M.L.
Ackerman E.E.
Aghamiri S.S.
Auffray C.
Augé F.
Babur O.
Bachman J.A.
Ballereau S.
Balling R.
Barillot E.
Bauch A.
Beckmann J.S.
Bocskei Z.
Borlinghaus H.
Brauner B.
Börnigen D.
Calzone L.
Conti M.
Coort S.
Czauderna T.
D'Eustachio P.
De Meulder B.
de Waard A.
Demir E.
Dopazo J.
Dräger A.
Dugourd A.
Ehrhart F.
Eijssen L.
Esteban-Medina M.
Evelo C.T.
Fergusson L.
Fobo G.
Fraser R.
Freeman T.C.
Frishman G.
Funahashi A.
Gawron P.
Gillespie M.E.
Glaab E.
Goble C.
Golebiewski M.
Grouès V.
Gyori B.M.
Hanspers K.
Hasenauer J.
Haw R.
Heirendt L.
Helikar T.
Hermjakob H.
Hiki Y.
Hiroi N.
Hoch M.
Hu X.
Iannuccelli M.
Jassal B.
Kitano H.
Kocakaya E.
Korcsmaros T.
Kumar Gupta S.
Kuperstein I.
Kutmon M.
Licata L.
Luna A.
Maier D.
Marchesi S.
Martens M.
Matthews L.
Mazein A.
Messina F.
Modos D.
Monraz Gómez L.C.
Montagud A.
Montrone C.
Nakonecnij V.
Naldi A.
Naveez M.
Nesterova A.
Niarakis A.
Noël V.
Olbei M.
Orlic-Milacic M.
Orta-Resendiz A.
Ortseifen V.
Ostaszewski M.
Overall R.W.
Owen S.
Oxford K.
Peña-Chilet M.
Phair R.
Pham N.
Pico A.R.
Ponce de Leon M.
Porras P.
Puniya B.L.
Rameil M.
Ravel J.M.
Renz A.
Rex DAB
Rian K.
Riutta A.
Rothfels K.
Ruepp A.
Sacco F.
Saez-Rodriguez J.
Sander C.
Satagopam V.
Scheel J.
Schmiester L.
Schneider R.
Schreiber F.
Senff Ribeiro A.
Sevilla C.
Shamovsky V.
Shoemaker J.E.
Singh V.
Slenter D.
Smula E.
Soliman S.
Somers J.
Stein L.D.
Stephan R.
Summak G.Y.
Teuton J.
Treveil A.
Turei D.
Valdeolivas A.
Valencia A.
Vanhoefer J.
Varusai T.
Vazquez M.
Vega C.
Wang M.
Wilighagen E.L.
Wolkenhauer O.
Wu G.
Yamada T.G.
Yuryev A.
Zucker J.
Publication venue: 'EMBO'
Publication date: 01/10/2021
Field of study

We need to effectively combine the knowledge from surging literature with complex datasets to propose mechanistic models of SARS-CoV-2 infection, improving data interpretation and predicting key targets of intervention. Here, we describe a large-scale community effort to build an open access, interoperable and computable repository of COVID-19 molecular mechanisms. The COVID-19 Disease Map (C19DMap) is a graphical, interactive representation of disease-relevant molecular mechanisms linking many knowledge sources. Notably, it is a computational resource for graph-based analyses and disease modelling. To this end, we established a framework of tools, platforms and guidelines necessary for a multifaceted community of biocurators, domain experts, bioinformaticians and computational biologists. The diagrams of the C19DMap, curated from the literature, are integrated with relevant interaction and text mining databases. We demonstrate the application of network analysis and modelling approaches by concrete examples to highlight new testable hypotheses. This framework helps to find signatures of SARS-CoV-2 predisposition, treatment response or prioritisation of drug candidates. Such an approach may help deal with new waves of COVID-19 or similar pandemics in the long-term perspective

Serveur académique lausannois