99 research outputs found

    Data-driven approach for creating synthetic electronic medical records

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>New algorithms for disease outbreak detection are being developed to take advantage of full electronic medical records (EMRs) that contain a wealth of patient information. However, due to privacy concerns, even anonymized EMRs cannot be shared among researchers, resulting in great difficulty in comparing the effectiveness of these algorithms. To bridge the gap between novel bio-surveillance algorithms operating on full EMRs and the lack of non-identifiable EMR data, a method for generating complete and synthetic EMRs was developed.</p> <p>Methods</p> <p>This paper describes a novel methodology for generating complete synthetic EMRs both for an outbreak illness of interest (tularemia) and for background records. The method developed has three major steps: 1) synthetic patient identity and basic information generation; 2) identification of care patterns that the synthetic patients would receive based on the information present in real EMR data for similar health problems; 3) adaptation of these care patterns to the synthetic patient population.</p> <p>Results</p> <p>We generated EMRs, including visit records, clinical activity, laboratory orders/results and radiology orders/results for 203 synthetic tularemia outbreak patients. Validation of the records by a medical expert revealed problems in 19% of the records; these were subsequently corrected. We also generated background EMRs for over 3000 patients in the 4-11 yr age group. Validation of those records by a medical expert revealed problems in fewer than 3% of these background patient EMRs and the errors were subsequently rectified.</p> <p>Conclusions</p> <p>A data-driven method was developed for generating fully synthetic EMRs. The method is general and can be applied to any data set that has similar data elements (such as laboratory and radiology orders and results, clinical activity, prescription orders). The pilot synthetic outbreak records were for tularemia but our approach may be adapted to other infectious diseases. The pilot synthetic background records were in the 4-11 year old age group. The adaptations that must be made to the algorithms to produce synthetic background EMRs for other age groups are indicated.</p

    A Holistic Landscape Description Reveals That Landscape Configuration Changes More over Time than Composition: Implications for Landscape Ecology Studies

    Get PDF
    International audienceBackground: Space-for-time substitution—that is, the assumption that spatial variations of a system can explain and predict the effect of temporal variations—is widely used in ecology. However, it is questionable whether it can validly be used to explain changes in biodiversity over time in response to land-cover changes.Hypothesis: ere, we hypothesize that different temporal vs spatial trajectories of landscape composition and configuration may limit space-for-time substitution in landscape ecology. Land-cover conversion changes not just the surface areas given over to particular types of land cover, but also affects isolation, patch size and heterogeneity. This means that a small change in land cover over time may have only minor repercussions on landscape composition but potentially major consequences for landscape configuration.Methods: sing land-cover maps of the Paris region for 1982 and 2003, we made a holistic description of the landscape disentangling landscape composition from configuration. After controlling for spatial variations, we analyzed and compared the amplitudes of changes in landscape composition and configuration over time.Results: For comparable spatial variations, landscape configuration varied more than twice as much as composition over time. Temporal changes in composition and configuration were not always spatially matched.Significance: The fact that landscape composition and configuration do not vary equally in space and time calls into question the use of space-for-time substitution in landscape ecology studies. The instability of landscapes over time appears to be attributable to configurational changes in the main. This may go some way to explaining why the landscape variables that account for changes over time in biodiversity are not the same ones that account for the spatial distribution of biodiversity

    Association between Acquired Uniparental Disomy and Homozygous Mutations and HER2/ER/PR Status in Breast Cancer

    Get PDF
    Background: Genetic alterations in cellular signaling networks are a hallmark of cancer, however, effective methods to discover them are lacking. A novel form of abnormality called acquired uniparental disomy (aUPD) was recently found to pinpoint the region of mutated genes in various cancers, thereby identifying the region for next-generation sequencing. Methods/Principal Findings: We retrieved large genomic data sets from the Gene Expression Omnibus database to perform genome-wide analysis of aUPD in breast tumor samples and cell lines using approaches that can reliably detect aUPD. Aupd was identified in 52.29% of the tumor samples. The most frequent aUPD regions were located at chromosomes 2q, 3p, 5q, 9p, 9q, 10q, 11q, 13q, 14q and 17q. We evaluated the data for any correlation between the most frequent aUPD regions and HER2/neu, ER, and PR status, and found a statistically significant correlation between the recurrent regions of aUPD and triple negative (TN) breast cancers. aUPD at chromosome 17q (VEZF1, WNT3), 3p (SUMF1, GRM7), 9p (MTAP, NFIB) and 11q (CASP1, CASP4, CASP5) are predictors for TN. The frequency of aUPD was found to be significantly higher in TN breast cancer cases compared to HER2/neu-positive and/or ER or PR-positive cases. Furthermore, using previously published mutation data, we found TP53 homozygously mutated in cell lines having aUPD in that locus. Conclusions/Significance: We conclude that aUPD is a common and non-random molecular feature of breast cancer that is most prominent in triple negative cases. As aUPD regions are different among the main pathological subtypes, specific aUPD regions may aid the sub-classification of breast cancer. In addition, we provide statistical support using TP53 as an example that identifying aUPD regions can be an effective approach in finding aberrant genes. We thus conclu

    Rapid Internalization of the Oncogenic K+ Channel KV10.1

    Get PDF
    KV10.1 is a mammalian brain voltage-gated potassium channel whose ectopic expression outside of the brain has been proven relevant for tumor biology. Promotion of cancer cell proliferation by KV10.1 depends largely on ion flow, but some oncogenic properties remain in the absence of ion permeation. Additionally, KV10.1 surface populations are small compared to large intracellular pools. Control of protein turnover within cells is key to both cellular plasticity and homeostasis, and therefore we set out to analyze how endocytic trafficking participates in controlling KV10.1 intracellular distribution and life cycle. To follow plasma membrane KV10.1 selectively, we generated a modified channel of displaying an extracellular affinity tag for surface labeling by α-bungarotoxin. This modification only minimally affected KV10.1 electrophysiological properties. Using a combination of microscopy and biochemistry techniques, we show that KV10.1 is constitutively internalized involving at least two distinct pathways of endocytosis and mainly sorted to lysosomes. This occurs at a relatively fast rate. Simultaneously, recycling seems to contribute to maintain basal KV10.1 surface levels. Brief KV10.1 surface half-life and rapid lysosomal targeting is a relevant factor to be taken into account for potential drug delivery and targeting strategies directed against KV10.1 on tumor cells

    Treatment of Rat Spinal Cord Injury with the Neurotrophic Factor Albumin-Oleic Acid: Translational Application for Paralysis, Spasticity and Pain

    Get PDF
    Sensorimotor dysfunction following incomplete spinal cord injury (iSCI) is often characterized by the debilitating symptoms of paralysis, spasticity and pain, which require treatment with novel pleiotropic pharmacological agents. Previous in vitro studies suggest that Albumin (Alb) and Oleic Acid (OA) may play a role together as an endogenous neurotrophic factor. Although Alb can promote basic recovery of motor function after iSCI, the therapeutic effect of OA or Alb-OA on a known translational measure of SCI associated with symptoms of spasticity and change in nociception has not been studied. Following T9 spinal contusion injury in Wistar rats, intrathecal treatment with: i) Saline, ii) Alb (0.4 nanomoles), iii) OA (80 nanomoles), iv) Alb-Elaidic acid (0.4/80 nanomoles), or v) Alb-OA (0.4/80 nanomoles) were evaluated on basic motor function, temporal summation of noxious reflex activity, and with a new test of descending modulation of spinal activity below the SCI up to one month after injury. Albumin, OA and Alb-OA treatment inhibited nociceptive Tibialis Anterior (TA) reflex activity. Moreover Alb-OA synergistically promoted early recovery of locomotor activity to 50±10% of control and promoted de novo phasic descending inhibition of TA noxious reflex activity to 47±5% following non-invasive electrical conditioning stimulation applied above the iSCI. Spinal L4–L5 immunohistochemistry demonstrated a unique increase in serotonin fibre innervation up to 4.2±1.1 and 2.3±0.3 fold within the dorsal and ventral horn respectively with Alb-OA treatment when compared to uninjured tissue, in addition to a reduction in NR1 NMDA receptor phosphorylation and microglia reactivity. Early recovery of voluntary motor function accompanied with tonic and de novo phasic descending inhibition of nociceptive TA flexor reflex activity following Alb-OA treatment, mediated via known endogenous spinal mechanisms of action, suggests a clinical application of this novel neurotrophic factor for the treatment of paralysis, spasticity and pain

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency–Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research

    Impact of clinical phenotypes on management and outcomes in European atrial fibrillation patients: a report from the ESC-EHRA EURObservational Research Programme in AF (EORP-AF) General Long-Term Registry

    Get PDF
    Background: Epidemiological studies in atrial fibrillation (AF) illustrate that clinical complexity increase the risk of major adverse outcomes. We aimed to describe European AF patients\u2019 clinical phenotypes and analyse the differential clinical course. Methods: We performed a hierarchical cluster analysis based on Ward\u2019s Method and Squared Euclidean Distance using 22 clinical binary variables, identifying the optimal number of clusters. We investigated differences in clinical management, use of healthcare resources and outcomes in a cohort of European AF patients from a Europe-wide observational registry. Results: A total of 9363 were available for this analysis. We identified three clusters: Cluster 1 (n = 3634; 38.8%) characterized by older patients and prevalent non-cardiac comorbidities; Cluster 2 (n = 2774; 29.6%) characterized by younger patients with low prevalence of comorbidities; Cluster 3 (n = 2955;31.6%) characterized by patients\u2019 prevalent cardiovascular risk factors/comorbidities. Over a mean follow-up of 22.5 months, Cluster 3 had the highest rate of cardiovascular events, all-cause death, and the composite outcome (combining the previous two) compared to Cluster 1 and Cluster 2 (all P &lt;.001). An adjusted Cox regression showed that compared to Cluster 2, Cluster 3 (hazard ratio (HR) 2.87, 95% confidence interval (CI) 2.27\u20133.62; HR 3.42, 95%CI 2.72\u20134.31; HR 2.79, 95%CI 2.32\u20133.35), and Cluster 1 (HR 1.88, 95%CI 1.48\u20132.38; HR 2.50, 95%CI 1.98\u20133.15; HR 2.09, 95%CI 1.74\u20132.51) reported a higher risk for the three outcomes respectively. Conclusions: In European AF patients, three main clusters were identified, differentiated by differential presence of comorbidities. Both non-cardiac and cardiac comorbidities clusters were found to be associated with an increased risk of major adverse outcomes

    Processing of joint molecule intermediates by structure-selective endonucleases during homologous recombination in eukaryotes

    Get PDF
    Homologous recombination is required for maintaining genomic integrity by functioning in high-fidelity repair of DNA double-strand breaks and other complex lesions, replication fork support, and meiotic chromosome segregation. Joint DNA molecules are key intermediates in recombination and their differential processing determines whether the genetic outcome is a crossover or non-crossover event. The Holliday model of recombination highlights the resolution of four-way DNA joint molecules, termed Holliday junctions, and the bacterial Holliday junction resolvase RuvC set the paradigm for the mechanism of crossover formation. In eukaryotes, much effort has been invested in identifying the eukaryotic equivalent of bacterial RuvC, leading to the discovery of a number of DNA endonucleases, including Mus81–Mms4/EME1, Slx1–Slx4/BTBD12/MUS312, XPF–ERCC1, and Yen1/GEN1. These nucleases exert different selectivity for various DNA joint molecules, including Holliday junctions. Their mutant phenotypes and distinct species-specific characteristics expose a surprisingly complex system of joint molecule processing. In an attempt to reconcile the biochemical and genetic data, we propose that nicked junctions constitute important in vivo recombination intermediates whose processing determines the efficiency and outcome (crossover/non-crossover) of homologous recombination
    corecore