Search CORE

64,833 research outputs found

Structuring the Unstructured: Unlocking pharmacokinetic data from journals with Natural Language Processing

Author: Gonzalez Hernandez Ferran
Publication venue: UCL (University College London)
Publication date: 28/09/2022
Field of study

The development of a new drug is an increasingly expensive and inefficient process. Many drug candidates are discarded due to pharmacokinetic (PK) complications detected at clinical phases. It is critical to accurately estimate the PK parameters of new drugs before being tested in humans since they will determine their efficacy and safety outcomes. Preclinical predictions of PK parameters are largely based on prior knowledge from other compounds, but much of this potentially valuable data is currently locked in the format of scientific papers. With an ever-increasing amount of scientific literature, automated systems are essential to exploit this resource efficiently. Developing text mining systems that can structure PK literature is critical to improving the drug development pipeline. This thesis studied the development and application of text mining resources to accelerate the curation of PK databases. Specifically, the development of novel corpora and suitable natural language processing architectures in the PK domain were addressed. The work presented focused on machine learning approaches that can model the high diversity of PK studies, parameter mentions, numerical measurements, units, and contextual information reported across the literature. Additionally, architectures and training approaches that could efficiently deal with the scarcity of annotated examples were explored. The chapters of this thesis tackle the development of suitable models and corpora to (1) retrieve PK documents, (2) recognise PK parameter mentions, (3) link PK entities to a knowledge base and (4) extract relations between parameter mentions, estimated measurements, units and other contextual information. Finally, the last chapter of this thesis studied the feasibility of the whole extraction pipeline to accelerate tasks in drug development research. The results from this thesis exhibited the potential of text mining approaches to automatically generate PK databases that can aid researchers in the field and ultimately accelerate the drug development pipeline. Additionally, the thesis presented contributions to biomedical natural language processing by developing suitable architectures and corpora for multiple tasks, tackling novel entities and relations within the PK domain

UCL Discovery

ORAC-DR: A generic data reduction pipeline infrastructure

Author: Economou Frossie
Jenness Tim
Publication venue: 'Elsevier BV'
Publication date: 28/10/2014
Field of study

ORAC-DR is a general purpose data reduction pipeline system designed to be instrument and observatory agnostic. The pipeline works with instruments as varied as infrared integral field units, imaging arrays and spectrographs, and sub-millimeter heterodyne arrays & continuum cameras. This paper describes the architecture of the pipeline system and the implementation of the core infrastructure. We finish by discussing the lessons learned since the initial deployment of the pipeline system in the late 1990s.Comment: 11 pages, 1 figure, accepted for publication in Astronomy and Computin

arXiv.org e-Print Archive

CiteSeerX

Elsevier - Publisher Connector

ePubs: the open archive for STFC research publications

Inviwo -- A Visualization System with Usage Abstraction Levels

Author: Englund Rickard
Falk Martin
Hotz Ingrid
Jönsson Daniel
Kottravel Sathish
Ropinski Timo
Steneteg Peter
Sundén Erik
Ynnerman Anders
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 10/10/2019
Field of study

The complexity of today's visualization applications demands specific visualization systems tailored for the development of these applications. Frequently, such systems utilize levels of abstraction to improve the application development process, for instance by providing a data flow network editor. Unfortunately, these abstractions result in several issues, which need to be circumvented through an abstraction-centered system design. Often, a high level of abstraction hides low level details, which makes it difficult to directly access the underlying computing platform, which would be important to achieve an optimal performance. Therefore, we propose a layer structure developed for modern and sustainable visualization systems allowing developers to interact with all contained abstraction levels. We refer to this interaction capabilities as usage abstraction levels, since we target application developers with various levels of experience. We formulate the requirements for such a system, derive the desired architecture, and present how the concepts have been exemplary realized within the Inviwo visualization system. Furthermore, we address several specific challenges that arise during the realization of such a layered architecture, such as communication between different computing platforms, performance centered encapsulation, as well as layer-independent development by supporting cross layer documentation and debugging capabilities

arXiv.org e-Print Archive

Publikationer från Linköpings universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

A proposal for a coordinated effort for the determination of brainwide neuroanatomical connectivity in model organisms at a mesoscopic scale

Author: A MacKenzie-Graham
A Reiner
A Vercelli
A Visel
Allan Jones
AM Hattox
Arthur W. Toga
AW Toga
AY Hardan
B Egaas
B Horwitz
BL Davidson
Brett D. Mensh
Bruce W. Stillman
C Gustafson
C Kobbert
Caizhi Wu
CL Veenman
Claus C. Hilgetag
Clifford B. Saper
CR Gerfen
D Atasoy
DA Benson
Daniel G. Herrera
David C. Van Essen
David Kleinfeld
DC Van Essen
DC Van Essen
DL Sparks
E Miyashita
ED Jarvis
Edward G. Jones
EM Callaway
ES Lein
ET Bullmore
F Castelli
F Crick
G Aston-Jones
H Markram
Hans C. Breiter
Harvey J. Karten
HC Breiter
Helen Barbas
Hemant Bokil
Henry A. Lester
Hollis T. Cline
IR Wickersham
J DeFalco
J Dejerine
J Panksepp
J Panksepp
Jaak Panksepp
James D. Watson
Jason W. Bohland
JD Schmahmann
Jeremy D. Schmahmann
JF Démonet
JG Bjaalie
JG Bjaalie
JG Bjaalie
JG White
JL Lanciego
JM Lin
John C. Doyle
John M. Lin
Joseph L. Price
Joseph Safdieh
K Oishi
K Wernicke
Karel Svoboda
KE Stephan
KE Stephan
L Ng
L Stein
Larry W. Swanson
LM Coolen
M Bota
M Bota
M Bota
M Murias
MA Just
MD Johnson
MI Ekstrand
Michael Hawrylycz
Mihail Bota
MJ Swift
N Geschwind
Nicholas D. Schiff
O Sporns
Olaf Sporns
Partha P. Mitra
Peter J. Freed
PH Luppi
PJ Broser
R Kotter
R Kotter
Ralph J. Greenspan
RH Güting
RM Kelly
Rolf Kötter
RW Baughman
S Folstein
S Lillehaug
S Mikula
Shawn Mikula
Suzanne N. Haber
U Burgel
U Frith
V Grinevich
Z. Josh Huang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2009
Field of study

In this era of complete genomes, our knowledge of neuroanatomical circuitry remains surprisingly sparse. Such knowledge is however critical both for basic and clinical research into brain function. Here we advocate for a concerted effort to fill this gap, through systematic, experimental mapping of neural circuits at a mesoscopic scale of resolution suitable for comprehensive, brain-wide coverage, using injections of tracers or viral vectors. We detail the scientific and medical rationale and briefly review existing knowledge and experimental techniques. We define a set of desiderata, including brain-wide coverage; validated and extensible experimental techniques suitable for standardization and automation; centralized, open access data repository; compatibility with existing resources, and tractability with current informatics technology. We discuss a hypothetical but tractable plan for mouse, additional efforts for the macaque, and technique development for human. We estimate that the mouse connectivity project could be completed within five years with a comparatively modest budget.Comment: 41 page

Cold Spring Harbor Laboratory Institutional Repository

Boston University Institutional Repository (OpenBU)

Directory of Open Access Journals

Caltech Authors

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

PubMed Central

Evidence Inference 2.0: More Data, Better Models

Author: DeYoung Jay
Lehman Eric
Marshall Iain J.
Nye Ben
Wallace Byron C.
Publication venue
Publication date: 01/01/2020
Field of study

How do we most effectively treat a disease or condition? Ideally, we could consult a database of evidence gleaned from clinical trials to answer such questions. Unfortunately, no such database exists; clinical trial results are instead disseminated primarily via lengthy natural language articles. Perusing all such articles would be prohibitively time-consuming for healthcare practitioners; they instead tend to depend on manually compiled systematic reviews of medical literature to inform care. NLP may speed this process up, and eventually facilitate immediate consult of published evidence. The Evidence Inference dataset was recently released to facilitate research toward this end. This task entails inferring the comparative performance of two treatments, with respect to a given outcome, from a particular article (describing a clinical trial) and identifying supporting evidence. For instance: Does this article report that chemotherapy performed better than surgery for five-year survival rates of operable cancers? In this paper, we collect additional annotations to expand the Evidence Inference dataset by 25\%, provide stronger baseline models, systematically inspect the errors that these make, and probe dataset quality. We also release an abstract only (as opposed to full-texts) version of the task for rapid model prototyping. The updated corpus, documentation, and code for new baselines and evaluations are available at http://evidence-inference.ebm-nlp.com/.Comment: Accepted as workshop paper into BioNLP Updated results from SciBERT to Biomed RoBERT

arXiv.org e-Print Archive

Crossref