Search CORE

154 research outputs found

Recommended from our members

Scalable Collaborative Infrastructure for a Learning Healthcare System (SCILHS): Architecture

Author: Adams William G
Bernstam Elmer
Bickel Jonathan
Ge Yaorong
Hripcsak George
Klann Jeffrey G
Kohane Isaac S
Mandel Joshua
Mandl Kenneth D
Marsolo Keith
McFadden Douglas
Murphy Shawn N
Natter Marc
Ofili Elizabeth
Perkins James
Quarshie Alexander
Schneeweiss Sebastian
Showalter John
Weber Griffin M
Weiler Sarah
Zhou Xiaobo
Publication venue: 'BMJ'
Publication date: 13/08/2014
Field of study

We describe the architecture of the Patient Centered Outcomes Research Institute (PCORI) funded Scalable Collaborative Infrastructure for a Learning Healthcare System (SCILHS, http://www.SCILHS.org) clinical data research network, which leverages the $48 billion dollar federal investment in health information technology (IT) to enable a queryable semantic data model across 10 health systems covering more than 8 million patients, plugging universally into the point of care, generating evidence and discovery, and thereby enabling clinician and patient participation in research during the patient encounter. Central to the success of SCILHS is development of innovative ‘apps’ to improve PCOR research methods and capacitate point of care functions such as consent, enrollment, randomization, and outreach for patient-reported outcomes. SCILHS adapts and extends an existing national research network formed on an advanced IT infrastructure built with open source, free, modular components

Harvard University - DASH

SEINE: Methods for Electronic Data Capture and Integrated Data Repository Synthesis with Patient Registry Use Cases

Author: Adagarla Bhargav
Connolly Daniel W.
D’Silva Linda J.
McMahon Tamara M.
Nair Manikandan
Sharma Priyanka
VanHoose Lisa D.
Waitman Lemuel R.
Publication venue: University of Kansas Medical Center
Publication date: 11/03/2015
Field of study

Integrated Data Repositories (IDR) allow clinical research to leverage electronic health records (EHR) and other data sources while Electronic Data Capture (EDC) applications often support manually maintained patient registries. Using i2b2 and REDCap, (IDR and EDC platforms respectively) we have developed methods that integrate IDR and EDC strengths supporting: 1) data delivery from the IDR as ready-to-use registries to exploit the annotation and data collection capabilities unique to EDC applications; 2) integrating EDC managed registries into data repositories allows investigators to use hypothesis generation and cohort discovery methods. This round-trip integration can lower lag between cohort discovery and establishing a registry. Investigators can also periodically augment their registry cohort as the IDR is enriched with additional data elements, data sources, and patients. We describe our open-source automated methods and provide three example registry uses cases for these methods: triple negative breast cancer, vertiginous syndrome, cancer distress

KU ScholarWorks

Implications of observation-fact modifiers to i2b2 ontologies

Author: Chatterjee Devjani
London Jack W.
Publication venue: Jefferson Digital Commons
Publication date: 12/11/2011
Field of study

Biomedical translational research can be facilitated by integrating clinical and research data. In particular, study cohort identification and hypothesis generation is enabled by the mining of integrated clinical observations and research resources. The informatics for integrating biology and the bedside, or i2b2, framework is widely used for this biomedical data mining. The i2b2 star schema data model using entity-attribute-value (EA V) formatted concepts is a very efficient strategy for querying large amounts of data. However, until the most recent i2b2 release, the utility of the platform was somewhat constrained by the limitations on being able to express facts about facts - i.e., modify the observations about the patients. We have found that exploiting the new modifier functionality has significantly and favorably impacted the design of i2b2 ontologies, leading to easier and more meaningful query results. Copyright © 2011 IEEE

Crossref

Jefferson Digital Commons

Recommended from our members

Computing Health Quality Measures Using Informatics for Integrating Biology and the Bedside

Author: Klann Jeffrey Gordon
Murphy Shawn Norman
Publication venue: 'JMIR Publications Inc.'
Publication date: 17/10/2013
Field of study

Background: The Health Quality Measures Format (HQMF) is a Health Level 7 (HL7) standard for expressing computable Clinical Quality Measures (CQMs). Creating tools to process HQMF queries in clinical databases will become increasingly important as the United States moves forward with its Health Information Technology Strategic Plan to Stages 2 and 3 of the Meaningful Use incentive program (MU2 and MU3). Informatics for Integrating Biology and the Bedside (i2b2) is one of the analytical databases used as part of the Office of the National Coordinator (ONC)’s Query Health platform to move toward this goal. Objective: Our goal is to integrate i2b2 with the Query Health HQMF architecture, to prepare for other HQMF use-cases (such as MU2 and MU3), and to articulate the functional overlap between i2b2 and HQMF. Therefore, we analyze the structure of HQMF, and then we apply this understanding to HQMF computation on the i2b2 clinical analytical database platform. Specifically, we develop a translator between two query languages, HQMF and i2b2, so that the i2b2 platform can compute HQMF queries. Methods: We use the HQMF structure of queries for aggregate reporting, which define clinical data elements and the temporal and logical relationships between them. We use the i2b2 XML format, which allows flexible querying of a complex clinical data repository in an easy-to-understand domain-specific language. Results: The translator can represent nearly any i2b2-XML query as HQMF and execute in i2b2 nearly any HQMF query expressible in i2b2-XML. This translator is part of the freely available reference implementation of the QueryHealth initiative. We analyze limitations of the conversion and find it covers many, but not all, of the complex temporal and logical operators required by quality measures. Conclusions: HQMF is an expressive language for defining quality measures, and it will be important to understand and implement for CQM computation, in both meaningful use and population health. However, its current form might allow complexity that is intractable for current database systems (both in terms of implementation and computation). Our translator, which supports the subset of HQMF currently expressible in i2b2-XML, may represent the beginnings of a practical compromise. It is being pilot-tested in two Query Health demonstration projects, and it can be further expanded to balance computational tractability with the advanced features needed by measure developers

Harvard University - DASH

An ICT infrastructure to integrate clinical and molecular data in oncology research

Author: Bellazzi Riccardo
Dagliati Arianna
Priori Silvia G
Segagni Daniele
Tibollo Valentina
Zambelli Alberto
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background The ONCO-i2b2 platform is a bioinformatics tool designed to integrate clinical and research data and support translational research in oncology. It is implemented by the University of Pavia and the IRCCS Fondazione Maugeri hospital (FSM), and grounded on the software developed by the Informatics for Integrating Biology and the Bedside (i2b2) research center. I2b2 has delivered an open source suite based on a data warehouse, which is efficiently interrogated to find sets of interesting patients through a query tool interface. Methods Onco-i2b2 integrates data coming from multiple sources and allows the users to jointly query them. I2b2 data are then stored in a data warehouse, where facts are hierarchically structured as ontologies. Onco-i2b2 gathers data from the FSM pathology unit (PU) database and from the hospital biobank and merges them with the clinical information from the hospital information system. Our main effort was to provide a robust integrated research environment, giving a particular emphasis to the integration process and facing different challenges, consecutively listed: biospecimen samples privacy and anonymization; synchronization of the biobank database with the i2b2 data warehouse through a series of Extract, Transform, Load (ETL) operations; development and integration of a Natural Language Processing (NLP) module, to retrieve coded information, such as SNOMED terms and malignant tumors (TNM) classifications, and clinical tests results from unstructured medical records. Furthermore, we have developed an internal SNOMED ontology rested on the NCBO BioPortal web services. Results Onco-i2b2 manages data of more than 6,500 patients with breast cancer diagnosis collected between 2001 and 2011 (over 390 of them have at least one biological sample in the cancer biobank), more than 47,000 visits and 96,000 observations over 960 medical concepts. Conclusions Onco-i2b2 is a concrete example of how integrated Information and Communication Technology architecture can be implemented to support translational research. The next steps of our project will involve the extension of its capabilities by implementing new plug-in devoted to bioinformatics data analysis as well as a temporal query module.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Recommended from our members

Query Health: standards-based, cross-platform population health surveillance

Author: Brown Jeffrey
Buck Michael D
Elmore Richard
Hadley Marc
Klann Jeffrey G
Murphy Shawn N
Weber Griffin M
Publication venue: 'BMJ'
Publication date: 13/08/2014
Field of study

Objective: Understanding population-level health trends is essential to effectively monitor and improve public health. The Office of the National Coordinator for Health Information Technology (ONC) Query Health initiative is a collaboration to develop a national architecture for distributed, population-level health queries across diverse clinical systems with disparate data models. Here we review Query Health activities, including a standards-based methodology, an open-source reference implementation, and three pilot projects. Materials and methods Query Health defined a standards-based approach for distributed population health queries, using an ontology based on the Quality Data Model and Consolidated Clinical Document Architecture, Health Quality Measures Format (HQMF) as the query language, the Query Envelope as the secure transport layer, and the Quality Reporting Document Architecture as the result language. Results: We implemented this approach using Informatics for Integrating Biology and the Bedside (i2b2) and hQuery for data analytics and PopMedNet for access control, secure query distribution, and response. We deployed the reference implementation at three pilot sites: two public health departments (New York City and Massachusetts) and one pilot designed to support Food and Drug Administration post-market safety surveillance activities. The pilots were successful, although improved cross-platform data normalization is needed. Discussions This initiative resulted in a standards-based methodology for population health queries, a reference implementation, and revision of the HQMF standard. It also informed future directions regarding interoperability and data access for ONC's Data Access Framework initiative. Conclusions: Query Health was a test of the learning health system that supplied a functional methodology and reference implementation for distributed population health queries that has been validated at three sites

Harvard University - DASH

Clinical Bioinformatics: challenges and opportunities

Author: Bellazzi R
Masseroli M
Murphy S
Romano P
Shabo A
Publication venue
Publication date: 01/01/2012
Field of study

Background: Network Tools and Applications in Biology (NETTAB) Workshops are a series of meetings focused on the most promising and innovative ICT tools and to their usefulness in Bioinformatics. The NETTAB 2011 workshop, held in Pavia, Italy, in October 2011 was aimed at presenting some of the most relevant methods, tools and infrastructures that are nowadays available for Clinical Bioinformatics (CBI), the research field that deals with clinical applications of bioinformatics. Methods: In this editorial, the viewpoints and opinions of three world CBI leaders, who have been invited to participate in a panel discussion of the NETTAB workshop on the next challenges and future opportunities of this field, are reported. These include the development of data warehouses and ICT infrastructures for data sharing, the definition of standards for sharing phenotypic data and the implementation of novel tools to implement efficient search computing solutions. Results: Some of the most important design features of a CBI-ICT infrastructure are presented, including data warehousing, modularity and flexibility, open-source development, semantic interoperability, integrated search and retrieval of –omics information. Conclusions: Clinical Bioinformatics goals are ambitious. Many factors, including the availability of high-throughput “-omics” technologies and equipment, the widespread availability of clinical data warehouses and the noteworthy increase in data storage and computational power of the most recent ICT systems, justify research and efforts in this domain, which promises to be a crucial leveraging factor for biomedical research

Archivio istituzionale della ricerca - Politecnico di Milano

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

Recommended from our members

An i2b2-based, generalizable, open source, self-scaling chronic disease registry

Author: Bousvaros Athos
Ilowite Norman T
Inman Christi J
Mandl Kenneth D
Marsolo Keith
McMurry Andrew J
Natter Marc D
Ortiz David M
Quan Justin
Sandborg Christy I
Schanberg Laura E
Wallace Carol A
Warren Robert W
Weber Griffin M
Publication venue: 'BMJ'
Publication date: 11/03/2014
Field of study

Objective: Registries are a well-established mechanism for obtaining high quality, disease-specific data, but are often highly project-specific in their design, implementation, and policies for data use. In contrast to the conventional model of centralized data contribution, warehousing, and control, we design a self-scaling registry technology for collaborative data sharing, based upon the widely adopted Integrating Biology & the Bedside (i2b2) data warehousing framework and the Shared Health Research Information Network (SHRINE) peer-to-peer networking software. Materials and methods Focusing our design around creation of a scalable solution for collaboration within multi-site disease registries, we leverage the i2b2 and SHRINE open source software to create a modular, ontology-based, federated infrastructure that provides research investigators full ownership and access to their contributed data while supporting permissioned yet robust data sharing. We accomplish these objectives via web services supporting peer-group overlays, group-aware data aggregation, and administrative functions. Results: The 56-site Childhood Arthritis & Rheumatology Research Alliance (CARRA) Registry and 3-site Harvard Inflammatory Bowel Diseases Longitudinal Data Repository now utilize i2b2 self-scaling registry technology (i2b2-SSR). This platform, extensible to federation of multiple projects within and between research networks, encompasses >6000 subjects at sites throughout the USA. Discussion We utilize the i2b2-SSR platform to minimize technical barriers to collaboration while enabling fine-grained control over data sharing. Conclusions: The implementation of i2b2-SSR for the multi-site, multi-stakeholder CARRA Registry has established a digital infrastructure for community-driven research data sharing in pediatric rheumatology in the USA. We envision i2b2-SSR as a scalable, reusable solution facilitating interdisciplinary research across diseases

Harvard University - DASH

From Hospital Informatics Laboratories to National Data Networks: Positioning Academic Medical Centers to Advance Clinical Research

Author: Waitman Lemuel R.
Publication venue: 'The University of Kansas'
Publication date: 01/09/2017
Field of study

The University of Kansas: Journals@KU

Biodiversity Informatics

Collaborative Cloud Computing Framework for Health Data with Open Source Technologies

Author: Bisong Ekaba
Miao Zhuqi
Scheufele Elisabeth
Weil Sage A
Winn Peter A
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/07/2020
Field of study

The proliferation of sensor technologies and advancements in data collection methods have enabled the accumulation of very large amounts of data. Increasingly, these datasets are considered for scientific research. However, the design of the system architecture to achieve high performance in terms of parallelization, query processing time, aggregation of heterogeneous data types (e.g., time series, images, structured data, among others), and difficulty in reproducing scientific research remain a major challenge. This is specifically true for health sciences research, where the systems must be i) easy to use with the flexibility to manipulate data at the most granular level, ii) agnostic of programming language kernel, iii) scalable, and iv) compliant with the HIPAA privacy law. In this paper, we review the existing literature for such big data systems for scientific research in health sciences and identify the gaps of the current system landscape. We propose a novel architecture for software-hardware-data ecosystem using open source technologies such as Apache Hadoop, Kubernetes and JupyterHub in a distributed environment. We also evaluate the system using a large clinical data set of 69M patients.Comment: This paper is accepted in ACM-BCB 202

arXiv.org e-Print Archive

Crossref