Search CORE

19,925 research outputs found

Knowledge-based Biomedical Data Science 2019

Author: Callahan Tiffany J.
Hunter Lawrence E.
Pielke-Lombardo Harrison
Tripodi Ignacio J.
Publication venue
Publication date: 08/10/2019
Field of study

Knowledge-based biomedical data science (KBDS) involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey the progress in the last year in systems that use formally represented knowledge to address data science problems in both clinical and biological domains, as well as on approaches for creating knowledge graphs. Major themes include the relationships between knowledge graphs and machine learning, the use of natural language processing, and the expansion of knowledge-based approaches to novel domains, such as Chinese Traditional Medicine and biodiversity.Comment: Manuscript 43 pages with 3 tables; Supplemental material 43 pages with 3 table

arXiv.org e-Print Archive

Using the Literature to Identify Confounders

Author: Malec Scott
Publication venue: DigitalCommons@TMC
Publication date: 01/01/2018
Field of study

Prior work in causal modeling has focused primarily on learning graph structures and parameters to model data generating processes from observational or experimental data, while the focus of the literature-based discovery paradigm was to identify novel therapeutic hypotheses in publicly available knowledge. The critical contribution of this dissertation is to refashion the literature-based discovery paradigm as a means to populate causal models with relevant covariates to abet causal inference. In particular, this dissertation describes a generalizable framework for mapping from causal propositions in the literature to subgraphs populated by instantiated variables that reflect observational data. The observational data are those derived from electronic health records. The purpose of causal inference is to detect adverse drug event signals. The Principle of the Common Cause is exploited as a heuristic for a defeasible practical logic. The fundamental intuition is that improbable co-occurrences can be “explained away” with reference to a common cause, or confounder. Semantic constraints in literature-based discovery can be leveraged to identify such covariates. Further, the asymmetric semantic constraints of causal propositions map directly to the topology of causal graphs as directed edges. The hypothesis is that causal models conditioned on sets of such covariates will improve upon the performance of purely statistical techniques for detecting adverse drug event signals. By improving upon previous work in purely EHR-based pharmacovigilance, these results establish the utility of this scalable approach to automated causal inference

DigitalCommons@The Texas Medical Center

Bayesian networks for disease diagnosis: What are they, who has used them and how?

Author: Barber Xavier
Muñoz-Valencia Carlos Segundo
Orozco Domingo
Quesada José Antonio
Publication venue
Publication date: 13/04/2023
Field of study

A Bayesian network (BN) is a probabilistic graph based on Bayes' theorem, used to show dependencies or cause-and-effect relationships between variables. They are widely applied in diagnostic processes since they allow the incorporation of medical knowledge to the model while expressing uncertainty in terms of probability. This systematic review presents the state of the art in the applications of BNs in medicine in general and in the diagnosis and prognosis of diseases in particular. Indexed articles from the last 40 years were included. The studies generally used the typical measures of diagnostic and prognostic accuracy: sensitivity, specificity, accuracy, precision, and the area under the ROC curve. Overall, we found that disease diagnosis and prognosis based on BNs can be successfully used to model complex medical problems that require reasoning under conditions of uncertainty.Comment: 22 pages, 5 figures, 1 table, Student PhD first pape

arXiv.org e-Print Archive

Bayesian Network model for students’ laboratory work performance assessment: an empirical investigation of the optimal construction approach

Author: Achumba Ifeyinwa
Azzi Djamel
Khusainov Rinat
Publication venue
Publication date: 01/07/2011
Field of study

Portsmouth University Research Portal (Pure)

“Do Not Kill Guinea Pig before Setting up Apparatus”: The Kymograph's Lost Educational Context

Author: Kwan Alistair Marcus
Publication venue
Publication date: 01/01/2016
Field of study

The objects of science education are transformed, degraded and disappeared for many reasons, and sometimes take other things with them when they go. This close reading of an undergraduate physiology laboratory report demonstrates how the kymograph was never a stand-alone instrument, but intertwined with conceptual frameworks and technical skills, laboratory amenities, materials, animal supply, technicians. Replacing the obsolete kymograph entails changing all of that, though our usual stories are focussed on progress associated with better measurements with fewer complications, not complications themselves. Such interconnectedness between progress and demise raises uncomfortable challenges for laboratory pedagogy, and for museum practice: what is laboratory education really about, and what kinds of heritage should museums, libraries and archives preserve to document it

PhilPapers

Humanities Commons

Synthetic Observational Health Data with GANs: from slow adoption to a boom in medical research and ultimately digital twins?

Author: Cirillo Elisa
Georges-Filteau Jeremy
Publication venue: 'Authorea, Inc.'
Publication date: 19/11/2020
Field of study

After being collected for patient care, Observational Health Data (OHD) can further benefit patient well-being by sustaining the development of health informatics and medical research. Vast potential is unexploited because of the fiercely private nature of patient-related data and regulations to protect it. Generative Adversarial Networks (GANs) have recently emerged as a groundbreaking way to learn generative models that produce realistic synthetic data. They have revolutionized practices in multiple domains such as self-driving cars, fraud detection, digital twin simulations in industrial sectors, and medical imaging. The digital twin concept could readily apply to modelling and quantifying disease progression. In addition, GANs posses many capabilities relevant to common problems in healthcare: lack of data, class imbalance, rare diseases, and preserving privacy. Unlocking open access to privacy-preserving OHD could be transformative for scientific research. In the midst of COVID-19, the healthcare system is facing unprecedented challenges, many of which of are data related for the reasons stated above. Considering these facts, publications concerning GAN applied to OHD seemed to be severely lacking. To uncover the reasons for this slow adoption, we broadly reviewed the published literature on the subject. Our findings show that the properties of OHD were initially challenging for the existing GAN algorithms (unlike medical imaging, for which state-of-the-art model were directly transferable) and the evaluation synthetic data lacked clear metrics. We find more publications on the subject than expected, starting slowly in 2017, and since then at an increasing rate. The difficulties of OHD remain, and we discuss issues relating to evaluation, consistency, benchmarking, data modelling, and reproducibility.Comment: 31 pages (10 in previous version), not including references and glossary, 51 in total. Inclusion of a large number of recent publications and expansion of the discussion accordingl

arXiv.org e-Print Archive

Usage Bibliometrics

Author: Abt
Accomazzi
Aggarwal
Baldi
Bar-Ilan
Bensman
Bertot
Blecic
Bollen
Bollen
Bollen
Bollen
Bollen
Bollen
Bollen
Bonitz
Borgman
Boyack
Boyack
Brin
Broadus
Broadus
Brody
Brody
Brookes
Burton
Börner
Börner
Castellano
Chen
Cooper
Craig
Cronin
Cronin
Cronin
Darmoni
Davis
Davis
Davis
Davis
Davis
Drott
Duy
Eason
Egghe
Eichhorn
Eysenbach
Eysenbach
Fortunato
Freire
Galvin
Gardner
Garfield
Garfield
Garfield
Gargouri
Georgakopoulos
Ginsparg
Ginsparg
Ginsparg
Ginsparg
Goldberg
Gosnell
Grant
Gross
Hajjem
Harnad
Harnad
Harnad
He
Henneken
Henneken
Henneken
Hider
Hood
Huntington
Jamali
Jansen
Jansen
Kaplan
King
King
King
King
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Ladwig
Lawrence
Leydesdorff
Leydesdorff
Leydesdorff
Line
Line
Liu
Ludascher
Luther
MacRoberts
May
Mayr
McDonald
Meadows
Merton
Moed
Moed
Moed
Moya-Anegón
Nicholas
Norris
Pan
Parker
Peters
Pinski
Pirolli
Price
Price
Price
Rice
Rosvall
Rowlands
Rowlands
Scales
Shepherd
Small
Stankus
Szalay
Szalay
Tenopir
Tenopir
Tonta
Trimble
Trimble
Tsay
Tsay
Van de Sompel
Van de Sompel
Van de Sompel
Walter
Wang
Wasserman
White
Wilson
York
Publication venue: 'Wiley'
Publication date: 14/02/2011
Field of study

Scholarly usage data provides unique opportunities to address the known shortcomings of citation analysis. However, the collection, processing and analysis of usage data remains an area of active research. This article provides a review of the state-of-the-art in usage-based informetric, i.e. the use of usage data to study the scholarly process.Comment: Publisher's PDF (by permission). Publisher web site: books.infotoday.com/asist/arist44.shtm

arXiv.org e-Print Archive

Crossref