2,468 research outputs found

    Experimental Evaluation and Development of a Silver-Standard for the MIMIC-III Clinical Coding Dataset

    Get PDF
    Clinical coding is currently a labour-intensive, error-prone, but critical administrative process whereby hospital patient episodes are manually assigned codes by qualified staff from large, standardised taxonomic hierarchies of codes. Automating clinical coding has a long history in NLP research and has recently seen novel developments setting new state of the art results. A popular dataset used in this task is MIMIC-III, a large intensive care database that includes clinical free text notes and associated codes. We argue for the reconsideration of the validity MIMIC-III's assigned codes that are often treated as gold-standard, especially when MIMIC-III has not undergone secondary validation. This work presents an open-source, reproducible experimental methodology for assessing the validity of codes derived from EHR discharge summaries. We exemplify the methodology with MIMIC-III discharge summaries and show the most frequently assigned codes in MIMIC-III are under-coded up to 35%

    Hospital-wide natural language processing summarising the health data of 1 million patients

    Get PDF
    Electronic health records (EHRs) represent a major repository of real world clinical trajectories, interventions and outcomes. While modern enterprise EHR's try to capture data in structured standardised formats, a significant bulk of the available information captured in the EHR is still recorded only in unstructured text format and can only be transformed into structured codes by manual processes. Recently, Natural Language Processing (NLP) algorithms have reached a level of performance suitable for large scale and accurate information extraction from clinical text. Here we describe the application of open-source named-entity-recognition and linkage (NER+L) methods (CogStack, MedCAT) to the entire text content of a large UK hospital trust (King's College Hospital, London). The resulting dataset contains 157M SNOMED concepts generated from 9.5M documents for 1.07M patients over a period of 9 years. We present a summary of prevalence and disease onset as well as a patient embedding that captures major comorbidity patterns at scale. NLP has the potential to transform the health data lifecycle, through large-scale automation of a traditionally manual task

    Exploring the components, asymmetry and distribution of relationship quality in wild Barbary macaques (Macaca sylvanus)

    Get PDF
    Social relationships between group members are a key feature of many animal societies. The quality of social relationships has been described by three main components: value, compatibility and security, based on the benefits, tenure and stability of social exchanges. We aimed to analyse whether this three component structure could be used to describe the quality of social relationships in wild Barbary macaques (Macaca sylvanus). Moreover, we examined whether relationship quality was affected by the sex, age and rank differences between social partners, and investigated the asymmetric nature of social relationships. We collected over 1,900 hours of focal data on seven behavioural variables measuring relationship quality, and used principal component analysis to investigate how these variables clustered together. We found that relationship quality in wild Barbary macaques can be described by a three component structure that represents the value, compatibility and security of a relationship. Female-female dyads had more valuable relationships and same-age dyads more compatible relationships than any other dyad. Rank difference had no effect on the quality of a social relationship. Finally, we found a high degree of asymmetry in how members of a dyad exchange social behaviour. We argue that the asymmetry of social relationships should be taken into account when exploring the pattern and function of social behaviour in animal societies

    Grooming coercion and the post-conflict trading of social services in wild Barbary macaques

    Get PDF
    In animal and human societies, social services such as protection from predators are often exchanged between group members. The tactics that individuals display to obtain a service depend on its value and on differences between individuals in their capacity to aggressively obtain it. Here we analysed the exchange of valuable social services (i.e. grooming and relationship repair) in the aftermath of a conflict, in wild Barbary macaques (Macaca sylvanus). The relationship repair function of post-conflict affiliation (i.e. reconciliation) was apparent in the victim but not in the aggressor. Conversely, we found evidence for grooming coercion by the aggressor; when the victim failed to give grooming soon after a conflict they received renewed aggression from the aggressor. We argue that post-conflict affiliation between former opponents can be better described as a trading of social services rather than coercion alone, as both animals obtain some benefits (i.e. grooming for the aggressor and relationship repair for the victim). Our study is the first to test the importance of social coercion in the aftermath of a conflict. Differences in competitive abilities can affect the exchange of services and the occurrence of social coercion in animal societies. This may also help explain the variance between populations and species in their social behaviour and conflict management strategies

    DNA-induced spatial entrapment of general transcription machinery can stabilize gene expression in a nondividing cell.

    Get PDF
    Funder: Wellcome TrustAn important characteristic of cell differentiation is its stability. Only rarely do cells or their stem cell progenitors change their differentiation pathway. If they do, it is often accompanied by a malfunction such as cancer. A mechanistic understanding of the stability of differentiated states would allow better prospects of alleviating the malfunctioning. However, such complete information is yet elusive. Earlier experiments performed in Xenopus oocytes to address this question suggest that a cell may maintain its gene expression by prolonged binding of cell type-specific transcription factors. Here, using DNA competition experiments, we show that the stability of gene expression in a nondividing cell could be caused by the local entrapment of part of the general transcription machinery in transcriptionally active regions. Strikingly, we found that transcriptionally active and silent forms of the same DNA template can stably coexist within the same nucleus. Both DNA templates are associated with the gene-specific transcription factor Ascl1, the core factor TBP2, and the polymerase II (Pol-II) ser5 C-terminal domain (CTD) phosphorylated form, while Pol-II ser2 CTD phosphorylation is restricted to the transcriptionally dominant template. We discover that the active and silent DNA forms are physically separated in the oocyte nucleus through partition into liquid-liquid phase-separated condensates. Altogether, our study proposes a mechanism of transcriptional regulation involving a spatial entrapment of general transcription machinery components to stabilize the active form of a gene in a nondividing cell

    Spatial distribution of psychotic disorders in an urban area of France: an ecological study

    Get PDF
    Previous analyses of neighbourhood variations of non-affective psychotic disorders (NAPD) have focused mainly on incidence. However, prevalence studies provide important insights on factors associated with disease evolution as well as for healthcare resource allocation. This study aimed to investigate the distribution of prevalent NAPD cases in an urban area in France. The number of cases in each neighbourhood was modelled as a function of potential confounders and ecological variables, namely: migrant density, economic deprivation and social fragmentation. This was modelled using statistical models of increasing complexity: frequentist models (using Poisson and negative binomial regressions), and several Bayesian models. For each model, assumptions validity were checked and compared as to how this fitted to the data, in order to test for possible spatial variation in prevalence. Data showed significant overdispersion (invalidating the Poisson regression model) and residual autocorrelation (suggesting the need to use Bayesian models). The best Bayesian model was Leroux's model (i.e. a model with both strong correlation between neighbouring areas and weaker correlation between areas further apart), with economic deprivation as an explanatory variable (OR = 1.13, 95% CI [1.02-1.25]). In comparison with frequentist methods, the Bayesian model showed a better fit. The number of cases showed non-random spatial distribution and was linked to economic deprivation

    Primary immunodeficiency

    Get PDF
    Primary immunodeficiency disorder (PID) refers to a heterogeneous group of over 130 disorders that result from defects in immune system development and/or function. PIDs are broadly classified as disorders of adaptive immunity (i.e., T-cell, B-cell or combined immunodeficiencies) or of innate immunity (e.g., phagocyte and complement disorders). Although the clinical manifestations of PIDs are highly variable, most disorders involve at least an increased susceptibility to infection. Early diagnosis and treatment are imperative for preventing significant disease-associated morbidity and, therefore, consultation with a clinical immunologist is essential. PIDs should be suspected in patients with: recurrent sinus or ear infections or pneumonias within a 1 year period; failure to thrive; poor response to prolonged use of antibiotics; persistent thrush or skin abscesses; or a family history of PID. Patients with multiple autoimmune diseases should also be evaluated. Diagnostic testing often involves lymphocyte proliferation assays, flow cytometry, measurement of serum immunoglobulin (Ig) levels, assessment of serum specific antibody titers in response to vaccine antigens, neutrophil function assays, stimulation assays for cytokine responses, and complement studies. The treatment of PIDs is complex and generally requires both supportive and definitive strategies. Ig replacement therapy is the mainstay of therapy for B-cell disorders, and is also an important supportive treatment for many patients with combined immunodeficiency disorders. The heterogeneous group of disorders involving the T-cell arm of the adaptive system, such as severe combined immunodeficiency (SCID), require immune reconstitution as soon as possible. The treatment of innate immunodeficiency disorders varies depending on the type of defect, but may involve antifungal and antibiotic prophylaxis, cytokine replacement, vaccinations and bone marrow transplantation. This article provides a detailed overview of the major categories of PIDs and strategies for the appropriate diagnosis and management of these rare disorders

    Comparison of cellular responses to TGF-β1 and BMP-2 between healthy and torn tendons

    Get PDF
    Background: Tendons heal by fibrotic repair, increasing the likelihood of reinjury. Animal tendon injury and overuse models have identified transforming growth factor beta (TGF-β) and bone morphogenetic proteins (BMPs) as growth factors actively involved in the development of fibrosis, by mediating extracellular matrix synthesis and cell differentiation. Purpose: To understand how TGF-β and BMPs contribute to fibrotic processes using tendon-derived cells isolated from healthy and diseased human tendons. Study Design: Controlled laboratory study. Methods: Tendon-derived cells were isolated from patients with a chronic rotator cuff tendon tear (large to massive, diseased) and healthy hamstring tendons of patients undergoing anterior cruciate ligament repair. Isolated cells were incubated with TGF-β1 (10 ng/mL) or BMP-2 (100 ng/mL) for 3 days. Gene expression was measured by real-time quantitative polymerase chain reaction. Cell signaling pathway activation was determined by Western blotting. Results: TGF-β1 treatment induced ACAN mRNA expression in both cell types but less in the diseased compared with healthy cells (P < .05). BMP-2 treatment induced BGN mRNA expression in healthy but not diseased cells (P < .01). In the diseased cells, TGF-β1 treatment induced increased ACTA2 mRNA expression (P < .01) and increased small mothers against decapentaplegic (SMAD) signaling (P < .05) compared with those of healthy cells. Moreover, BMP-2 treatment induced ACTA2 mRNA expression in the diseased cells only (P < .05). Conclusion: Diseased tendon–derived cells show reduced expression of the proteoglycans aggrecan and biglycan in response to TGF-β1 and BMP-2 treatments. These same treatments induced enhanced fibrotic differentiation and canonical SMAD cell signaling in diseased compared with healthy cells. Clinical Relevance: Findings from this study suggest that diseased tendon–derived cells respond differently than healthy cells in the presence of TGF-β1 and BMP-2. The altered responses of diseased cells may influence fibrotic repair processes during tendon healing

    Three-Dimensional Spectral-Domain Optical Coherence Tomography Data Analysis for Glaucoma Detection

    Get PDF
    Purpose: To develop a new three-dimensional (3D) spectral-domain optical coherence tomography (SD-OCT) data analysis method using a machine learning technique based on variable-size super pixel segmentation that efficiently utilizes full 3D dataset to improve the discrimination between early glaucomatous and healthy eyes. Methods: 192 eyes of 96 subjects (44 healthy, 59 glaucoma suspect and 89 glaucomatous eyes) were scanned with SD-OCT. Each SD-OCT cube dataset was first converted into 2D feature map based on retinal nerve fiber layer (RNFL) segmentation and then divided into various number of super pixels. Unlike the conventional super pixel having a fixed number of points, this newly developed variable-size super pixel is defined as a cluster of homogeneous adjacent pixels with variable size, shape and number. Features of super pixel map were extracted and used as inputs to machine classifier (LogitBoost adaptive boosting) to automatically identify diseased eyes. For discriminating performance assessment, area under the curve (AUC) of the receiver operating characteristics of the machine classifier outputs were compared with the conventional circumpapillary RNFL (cpRNFL) thickness measurements. Results: The super pixel analysis showed statistically significantly higher AUC than the cpRNFL (0.855 vs. 0.707, respectively, p = 0.031, Jackknife test) when glaucoma suspects were discriminated from healthy, while no significant difference was found when confirmed glaucoma eyes were discriminated from healthy eyes. Conclusions: A novel 3D OCT analysis technique performed at least as well as the cpRNFL in glaucoma discrimination and even better at glaucoma suspect discrimination. This new method has the potential to improve early detection of glaucomatous damage. © 2013 Xu et al
    • …
    corecore