28,793 research outputs found
Implementing a Portable Clinical NLP System with a Common Data Model - a Lisp Perspective
This paper presents a Lisp architecture for a portable NLP system, termed
LAPNLP, for processing clinical notes. LAPNLP integrates multiple standard,
customized and in-house developed NLP tools. Our system facilitates portability
across different institutions and data systems by incorporating an enriched
Common Data Model (CDM) to standardize necessary data elements. It utilizes
UMLS to perform domain adaptation when integrating generic domain NLP tools. It
also features stand-off annotations that are specified by positional reference
to the original document. We built an interval tree based search engine to
efficiently query and retrieve the stand-off annotations by specifying
positional requirements. We also developed a utility to convert an inline
annotation format to stand-off annotations to enable the reuse of clinical text
datasets with inline annotations. We experimented with our system on several
NLP facilitated tasks including computational phenotyping for lymphoma patients
and semantic relation extraction for clinical notes. These experiments
showcased the broader applicability and utility of LAPNLP.Comment: 6 pages, accepted by IEEE BIBM 2018 as regular pape
Recommended from our members
Nutritional and Post-Transplantation Outcomes of Enteral versus Parenteral Nutrition in Pediatric Hematopoietic Stem Cell Transplantation: A Systematic Review of Randomized and Nonrandomized Studies
Hematopoietic stem cell transplantation (HSCT) involves the administration of chemotherapy followed by the infusion of donor stem cells. After treatment, children can consequently experience nausea, vomiting, diarrhea, anorexia, and mucositis, which negatively impact oral intake, leading to rapid deterioration in nutritional status and risk of malnutrition. Nutrition support therefore becomes necessary to circumvent these adverse effects. This has traditionally been provided via parenteral nutrition (PN), but pediatric evidence is increasingly advocating enteral nutrition (EN) as a preferential alternative. The objective of this review is to determine the efficacy of any forms of EN versus PN provided during admission to children aged ≤ 18 years undergoing HSCT. Primary outcomes considered efficacy in relation to various nutritional parameters, and secondary outcomes included a range of post-transplantation parameters. Data sources included English and non-English articles from the start date of MEDLINE, EMBASE, AMED, CINAHL and Cochrane Controlled Trials register, up to July 2018. Key journals were also hand searched, reference lists scanned, clinical experts contacted, and gray literature searched using EThOS and Open Grey. Randomized and observational studies comparing any forms of EN versus PN in children aged ≤ 18 years undergoing HSCT investigating nutritional or post-transplantation outcomes were eligible. Data were extracted from included studies using a custom extraction form that had previously been piloted. Because included studies were observational, risk of bias was assessed using Risk of Bias in Non-randomised Studies of Interventions. Because only a small number of heterogenous studies reporting a wide range of differently defined outcomes were included, meta-analyses were not performed and data were presented in narrative form. Conflicting results in favor of either method of nutrition support or no difference between methods were seen for duration of interventions, nutritional intakes, biochemical and anthropometric changes, mortality, infections, length of admission, and neutrophil engraftment. EN may provide favorable benefits over PN regarding acute graft-versus-host-disease (aGVHD) and platelet engraftment. A paucity of studies was found investigating the question posed by this review. Included studies were clinically heterogenous regarding populations, interventions, and outcomes, at moderate to serious risk of bias due to the absence of randomization, confounding parameters, statistical control, retrospective designs, and participant selection. Some studies were more than 15 years old. Despite the limited number and poor quality of identified studies, results support the growing body of pediatric evidence that EN is feasible during HSCT. Similar differences regarding many nutritional and post-transplantation outcomes were seen in both forms of nutrition support, but EN could provide benefits above PN including reduced incidence of aGVHD and faster platelet engraftment
Advanced Knowledge Technologies at the Midterm: Tools and Methods for the Semantic Web
The University of Edinburgh and research sponsors are authorised to reproduce and distribute reprints and on-line copies for their purposes notwithstanding any copyright annotation hereon. The views and conclusions contained herein are the author’s and shouldn’t be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of other parties.In a celebrated essay on the new electronic media, Marshall McLuhan wrote in 1962:Our private senses are not closed systems but are endlessly translated into each other in that experience which we call consciousness. Our extended senses, tools, technologies, through the ages, have been closed systems incapable of interplay or collective awareness. Now, in the electric age, the very
instantaneous nature of co-existence among our technological instruments has created a crisis quite new in human history. Our extended faculties and senses now constitute a single field of experience which demands that they become collectively conscious. Our technologies, like our private senses, now demand an interplay and ratio that makes rational co-existence possible. As long as our technologies were as slow as the wheel or the alphabet or money, the fact that
they were separate, closed systems was socially and psychically supportable. This is not true now when sight and sound and movement are simultaneous and global in extent. (McLuhan 1962, p.5, emphasis in original)Over forty years later, the seamless interplay that McLuhan demanded between our
technologies is still barely visible. McLuhan’s predictions of the spread, and increased importance, of electronic media have of course been borne out, and the worlds of business, science and knowledge storage and transfer have been revolutionised. Yet
the integration of electronic systems as open systems remains in its infancy.Advanced Knowledge Technologies (AKT) aims to address this problem, to create a view of knowledge and its management across its lifecycle, to research and create the
services and technologies that such unification will require. Half way through its sixyear span, the results are beginning to come through, and this paper will explore some of the services, technologies and methodologies that have been developed. We hope to give a sense in this paper of the potential for the next three years, to discuss the insights and lessons learnt in the first phase of the project, to articulate the challenges and issues that remain.The WWW provided the original context that made the AKT approach to knowledge
management (KM) possible. AKT was initially proposed in 1999, it brought together an interdisciplinary consortium with the technological breadth and complementarity to create the conditions for a unified approach to knowledge across its lifecycle. The
combination of this expertise, and the time and space afforded the consortium by the
IRC structure, suggested the opportunity for a concerted effort to develop an approach
to advanced knowledge technologies, based on the WWW as a basic infrastructure.The technological context of AKT altered for the better in the short period between the development of the proposal and the beginning of the project itself with the development of the semantic web (SW), which foresaw much more intelligent manipulation and querying of knowledge. The opportunities that the SW provided for e.g., more intelligent retrieval, put AKT in the centre of information technology innovation and knowledge management services; the AKT skill set would clearly be central for the exploitation of those opportunities.The SW, as an extension of the WWW, provides an interesting set of constraints to
the knowledge management services AKT tries to provide. As a medium for the
semantically-informed coordination of information, it has suggested a number of ways in which the objectives of AKT can be achieved, most obviously through the
provision of knowledge management services delivered over the web as opposed to the creation and provision of technologies to manage knowledge.AKT is working on the assumption that many web services will be developed and provided for users. The KM problem in the near future will be one of deciding which services are needed and of coordinating them. Many of these services will be largely or entirely legacies of the WWW, and so the capabilities of the services will vary. As well as providing useful KM services in their own right, AKT will be aiming to exploit this opportunity, by reasoning over services, brokering between them, and providing essential meta-services for SW knowledge service management.Ontologies will be a crucial tool for the SW. The AKT consortium brings a lot of expertise on ontologies together, and ontologies were always going to be a key part of the strategy. All kinds of knowledge sharing and transfer activities will be mediated by ontologies, and ontology management will be an important enabling task. Different
applications will need to cope with inconsistent ontologies, or with the problems that will follow the automatic creation of ontologies (e.g. merging of pre-existing
ontologies to create a third). Ontology mapping, and the elimination of conflicts of
reference, will be important tasks. All of these issues are discussed along with our
proposed technologies.Similarly, specifications of tasks will be used for the deployment of knowledge services over the SW, but in general it cannot be expected that in the medium term there will be standards for task (or service) specifications. The brokering metaservices
that are envisaged will have to deal with this heterogeneity.The emerging picture of the SW is one of great opportunity but it will not be a wellordered, certain or consistent environment. It will comprise many repositories of legacy data, outdated and inconsistent stores, and requirements for common understandings across divergent formalisms. There is clearly a role for standards to play to bring much of this context together; AKT is playing a significant role in these efforts. But standards take time to emerge, they take political power to enforce, and they have been known to stifle innovation (in the short term). AKT is keen to understand the balance between principled inference and statistical processing of web content. Logical inference on the Web is tough. Complex queries using traditional AI inference methods bring most distributed computer systems to their knees. Do we set up semantically well-behaved areas of the Web? Is any part of the Web in which
semantic hygiene prevails interesting enough to reason in? These and many other
questions need to be addressed if we are to provide effective knowledge technologies
for our content on the web
Improving institutional memory on challenges and methods for estimation of pig herd antimicrobial exposure based on data from the Danish Veterinary Medicines Statistics Program (VetStat)
With the increasing occurrence of antimicrobial resistance, more attention
has been directed towards surveillance of both human and veterinary
antimicrobial use. Since the early 2000s, several research papers on Danish pig
antimicrobial usage have been published, based on data from the Danish
Veterinary Medicines Statistics Program (VetStat). VetStat was established in
2000, as a national database containing detailed information on purchases of
veterinary medicine. This paper presents a critical set of challenges
originating from static system features, which researchers must address when
estimating antimicrobial exposure in Danish pig herds. Most challenges
presented are followed by at least one robust solution. A set of challenges
requiring awareness from the researcher, but for which no immediate solution
was available, were also presented. The selection of challenges and solutions
was based on a consensus by a cross-institutional group of researchers working
in projects using VetStat data. No quantitative data quality evaluations were
performed, as the frequency of errors and inconsistencies in a dataset will
vary, depending on the period covered in the data. Instead, this paper focuses
on clarifying how VetStat data may be translated to an estimation of the
antimicrobial exposure at herd level, by suggesting uniform methods of
extracting and editing data, in order to obtain reliable and comparable
estimates on pig antimicrobial consumption for research purposes.Comment: 25 pages, including two Appendices (pages not numbered). Title page,
including abstract, is on page 1. Body of text, including references,
abbreviation list and disclaimers for conflict of interest and funding, are
on pages 2-18. Two figures embedded in the text on pages 3 and 5. Appendix 1
starts on page 19, and Appendix 2 on page 2
Automated Detection of Substance-Use Status and Related Information from Clinical Text
This study aims to develop and evaluate an automated system for extracting information related to patient substance use (smoking, alcohol, and drugs) from unstructured clinical text (medical discharge records). The authors propose a four-stage system for the extraction of the substance-use status and related attributes (type, frequency, amount, quit-time, and period). The first stage uses a keyword search technique to detect sentences related to substance use and to exclude unrelated records. In the second stage, an extension of the NegEx negation detection algorithm is developed and employed for detecting the negated records. The third stage involves identifying the temporal status of the substance use by applying windowing and chunking methodologies. Finally, in the fourth stage, regular expressions, syntactic patterns, and keyword search techniques are used in order to extract the substance-use attributes. The proposed system achieves an F1-score of up to 0.99 for identifying substance-use-related records, 0.98 for detecting the negation status, and 0.94 for identifying temporal status. Moreover, F1-scores of up to 0.98, 0.98, 1.00, 0.92, and 0.98 are achieved for the extraction of the amount, frequency, type, quit-time, and period attributes, respectively. Natural Language Processing (NLP) and rule-based techniques are employed efficiently for extracting substance-use status and attributes, with the proposed system being able to detect substance-use status and attributes over both sentence-level and document-level data. Results show that the proposed system outperforms the compared state-of-the-art substance-use identification system on an unseen dataset, demonstrating its generalisability
Knowledge-based Biomedical Data Science 2019
Knowledge-based biomedical data science (KBDS) involves the design and
implementation of computer systems that act as if they knew about biomedicine.
Such systems depend on formally represented knowledge in computer systems,
often in the form of knowledge graphs. Here we survey the progress in the last
year in systems that use formally represented knowledge to address data science
problems in both clinical and biological domains, as well as on approaches for
creating knowledge graphs. Major themes include the relationships between
knowledge graphs and machine learning, the use of natural language processing,
and the expansion of knowledge-based approaches to novel domains, such as
Chinese Traditional Medicine and biodiversity.Comment: Manuscript 43 pages with 3 tables; Supplemental material 43 pages
with 3 table
- …