73 research outputs found

    Ensuring phenotyping algorithms using national electronic health records are FAIR:Meeting the needs of the cardiometabolic research community

    Get PDF
    Phenotyping algorithms enable the extraction of clinically-relevant information (such as diagnoses, prescription information, or a blood pressure measurement) from electronic health records for use in research. They have enormous potential and wide-ranging utility in research to improve disease understanding, health, and healthcare provision. While great progress has been achieved over the past years in standardising how genomic data are represented and curated (e.g. VCF files for variants), phenotypic data are significantly more fragmented and lack a common representation approach. This lack of standards creates challenges, including a lack of comparability, transparency and reproducibility, and limiting the subsequent use of phenotyping algorithms in other research studies. The FAIR guiding principles for scientific data management and stewardship state that digital assets should be findable, accessible, interoperable and reusable, yet the current lack of phenotyping algorithm standards means that phenotyping algorithms are not FAIR. We have therefore engaged with the community to address these challenges, including defining standards for the reporting and sharing of phenotyping algorithms. Here we present the results of our engagement with the community to identify and explore their requirements and outline our recommendations to ensure FAIR phenotyping algorithms are available to meet the needs of the cardiometabolic research community

    Dominant suppression of inflammation via targeted mutation of the mRNA destabilizing protein tristetraprolin

    Get PDF
    In myeloid cells, the mRNA-destabilizing protein tristetraprolin (TTP) is induced and extensively phosphorylated in response to LPS. To investigate the role of two specific phosphorylations, at serines 52 and 178, we created a mouse strain in which those residues were replaced by nonphosphorylatable alanine residues. The mutant form of TTP was constitutively degraded by the proteasome and therefore expressed at low levels, yet it functioned as a potent mRNA destabilizing factor and inhibitor of the expression of many inflammatory mediators. Mice expressing only the mutant form of TTP were healthy and fertile, and their systemic inflammatory responses to LPS were strongly attenuated. Adaptive immune responses and protection against infection by Salmonella typhimurium were spared. A single allele encoding the mutant form of TTP was sufficient for enhanced mRNA degradation and underexpression of inflammatory mediators. Therefore, the equilibrium between unphosphorylated and phosphorylated TTP is a critical determinant of the inflammatory response, and manipulation of this equilibrium may be a means of treating inflammatory pathologies

    Transient Storage as a Function of Geomorphology, Discharge, and Permafrost Active Layer Conditions in Arctic Tundra Streams

    Get PDF
    Transient storage of solutes in hyporheic zones or other slow-moving stream waters plays an important role in the biogeochemical processes of streams. While numerous studies have reported a wide range of parameter values from simulations of transient storage, little field work has been done to investigate the correlations between these parameters and shifts in surface and subsurface flow conditions. In this investigation we use the stream properties of the Arctic (namely, highly varied discharges, channel morphologies, and subchannel permafrost conditions) to isolate the effects of discharge, channel morphology, and potential size of the hyporheic zone on transient storage. We repeated stream tracer experiments in five morphologically diverse tundra streams in Arctic Alaska during the thaw season (May–August) of 2004 to assess transient storage and hydrologic characteristics. We compared transient storage model parameters to discharge (Q), the Darcy-Weisbach friction factor (f), and unit stream power (ω). Across all studied streams, permafrost active layer depths (i.e., the potential extent of the hyporheic zone) increased throughout the thaw season, and discharges and velocities varied dramatically with minimum ranges of eight-fold and four-fold, respectively. In all reaches the mean storage residence time (tstor) decreased exponentially with increasing Q, but did not clearly relate to permafrost active layer depths. Furthermore, we found that modeled transient storage metrics (i.e., tstor, storage zone exchange rate (αOTIS), and hydraulic retention (Rh)) correlated better with channel hydraulic descriptors such as f and ω than they did with Q or channel slope. Our results indicate that Q is the first-order control on transient storage dynamics of these streams, and that f and ω are two relatively simple measures of channel hydraulics that may be important metrics for predicting the response of transient storage to perturbations in discharge and morphology in a given stream

    A genetic variation map for chicken with 2.8 million single-nucleotide polymorphisms

    Get PDF
    We describe a genetic variation map for the chicken genome containing 2.8 million single-nucleotide polymorphisms ( SNPs). This map is based on a comparison of the sequences of three domestic chicken breeds ( a broiler, a layer and a Chinese silkie) with that of their wild ancestor, red jungle fowl. Subsequent experiments indicate that at least 90% of the variant sites are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds. Mean nucleotide diversity is about five SNPs per kilobase for almost every possible comparison between red jungle fowl and domestic lines, between two different domestic lines, and within domestic lines - in contrast to the notion that domestic animals are highly inbred relative to their wild ancestors. In fact, most of the SNPs originated before domestication, and there is little evidence of selective sweeps for adaptive alleles on length scales greater than 100 kilobases

    COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records

    Get PDF
    BACKGROUND: Updatable estimates of COVID-19 onset, progression, and trajectories underpin pandemic mitigation efforts. To identify and characterise disease trajectories, we aimed to define and validate ten COVID-19 phenotypes from nationwide linked electronic health records (EHR) using an extensible framework. METHODS: In this cohort study, we used eight linked National Health Service (NHS) datasets for people in England alive on Jan 23, 2020. Data on COVID-19 testing, vaccination, primary and secondary care records, and death registrations were collected until Nov 30, 2021. We defined ten COVID-19 phenotypes reflecting clinically relevant stages of disease severity and encompassing five categories: positive SARS-CoV-2 test, primary care diagnosis, hospital admission, ventilation modality (four phenotypes), and death (three phenotypes). We constructed patient trajectories illustrating transition frequency and duration between phenotypes. Analyses were stratified by pandemic waves and vaccination status. FINDINGS: Among 57 032 174 individuals included in the cohort, 13 990 423 COVID-19 events were identified in 7 244 925 individuals, equating to an infection rate of 12·7% during the study period. Of 7 244 925 individuals, 460 737 (6·4%) were admitted to hospital and 158 020 (2·2%) died. Of 460 737 individuals who were admitted to hospital, 48 847 (10·6%) were admitted to the intensive care unit (ICU), 69 090 (15·0%) received non-invasive ventilation, and 25 928 (5·6%) received invasive ventilation. Among 384 135 patients who were admitted to hospital but did not require ventilation, mortality was higher in wave 1 (23 485 [30·4%] of 77 202 patients) than wave 2 (44 220 [23·1%] of 191 528 patients), but remained unchanged for patients admitted to the ICU. Mortality was highest among patients who received ventilatory support outside of the ICU in wave 1 (2569 [50·7%] of 5063 patients). 15 486 (9·8%) of 158 020 COVID-19-related deaths occurred within 28 days of the first COVID-19 event without a COVID-19 diagnoses on the death certificate. 10 884 (6·9%) of 158 020 deaths were identified exclusively from mortality data with no previous COVID-19 phenotype recorded. We observed longer patient trajectories in wave 2 than wave 1. INTERPRETATION: Our analyses illustrate the wide spectrum of disease trajectories as shown by differences in incidence, survival, and clinical pathways. We have provided a modular analytical framework that can be used to monitor the impact of the pandemic and generate evidence of clinical and policy relevance using multiple EHR sources. FUNDING: British Heart Foundation Data Science Centre, led by Health Data Research UK

    Procalcitonin Is Not a Reliable Biomarker of Bacterial Coinfection in People With Coronavirus Disease 2019 Undergoing Microbiological Investigation at the Time of Hospital Admission

    Get PDF
    Abstract Admission procalcitonin measurements and microbiology results were available for 1040 hospitalized adults with coronavirus disease 2019 (from 48 902 included in the International Severe Acute Respiratory and Emerging Infections Consortium World Health Organization Clinical Characterisation Protocol UK study). Although procalcitonin was higher in bacterial coinfection, this was neither clinically significant (median [IQR], 0.33 [0.11–1.70] ng/mL vs 0.24 [0.10–0.90] ng/mL) nor diagnostically useful (area under the receiver operating characteristic curve, 0.56 [95% confidence interval, .51–.60]).</jats:p

    Implementation of corticosteroids in treating COVID-19 in the ISARIC WHO Clinical Characterisation Protocol UK:prospective observational cohort study

    Get PDF
    BACKGROUND: Dexamethasone was the first intervention proven to reduce mortality in patients with COVID-19 being treated in hospital. We aimed to evaluate the adoption of corticosteroids in the treatment of COVID-19 in the UK after the RECOVERY trial publication on June 16, 2020, and to identify discrepancies in care. METHODS: We did an audit of clinical implementation of corticosteroids in a prospective, observational, cohort study in 237 UK acute care hospitals between March 16, 2020, and April 14, 2021, restricted to patients aged 18 years or older with proven or high likelihood of COVID-19, who received supplementary oxygen. The primary outcome was administration of dexamethasone, prednisolone, hydrocortisone, or methylprednisolone. This study is registered with ISRCTN, ISRCTN66726260. FINDINGS: Between June 17, 2020, and April 14, 2021, 47 795 (75·2%) of 63 525 of patients on supplementary oxygen received corticosteroids, higher among patients requiring critical care than in those who received ward care (11 185 [86·6%] of 12 909 vs 36 415 [72·4%] of 50 278). Patients 50 years or older were significantly less likely to receive corticosteroids than those younger than 50 years (adjusted odds ratio 0·79 [95% CI 0·70–0·89], p=0·0001, for 70–79 years; 0·52 [0·46–0·58], p80 years), independent of patient demographics and illness severity. 84 (54·2%) of 155 pregnant women received corticosteroids. Rates of corticosteroid administration increased from 27·5% in the week before June 16, 2020, to 75–80% in January, 2021. INTERPRETATION: Implementation of corticosteroids into clinical practice in the UK for patients with COVID-19 has been successful, but not universal. Patients older than 70 years, independent of illness severity, chronic neurological disease, and dementia, were less likely to receive corticosteroids than those who were younger, as were pregnant women. This could reflect appropriate clinical decision making, but the possibility of inequitable access to life-saving care should be considered. FUNDING: UK National Institute for Health Research and UK Medical Research Council

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    The impact of viral mutations on recognition by SARS-CoV-2 specific T cells.

    Get PDF
    We identify amino acid variants within dominant SARS-CoV-2 T cell epitopes by interrogating global sequence data. Several variants within nucleocapsid and ORF3a epitopes have arisen independently in multiple lineages and result in loss of recognition by epitope-specific T cells assessed by IFN-γ and cytotoxic killing assays. Complete loss of T cell responsiveness was seen due to Q213K in the A∗01:01-restricted CD8+ ORF3a epitope FTSDYYQLY207-215; due to P13L, P13S, and P13T in the B∗27:05-restricted CD8+ nucleocapsid epitope QRNAPRITF9-17; and due to T362I and P365S in the A∗03:01/A∗11:01-restricted CD8+ nucleocapsid epitope KTFPPTEPK361-369. CD8+ T cell lines unable to recognize variant epitopes have diverse T cell receptor repertoires. These data demonstrate the potential for T cell evasion and highlight the need for ongoing surveillance for variants capable of escaping T cell as well as humoral immunity.This work is supported by the UK Medical Research Council (MRC); Chinese Academy of Medical Sciences(CAMS) Innovation Fund for Medical Sciences (CIFMS), China; National Institute for Health Research (NIHR)Oxford Biomedical Research Centre, and UK Researchand Innovation (UKRI)/NIHR through the UK Coro-navirus Immunology Consortium (UK-CIC). Sequencing of SARS-CoV-2 samples and collation of data wasundertaken by the COG-UK CONSORTIUM. COG-UK is supported by funding from the Medical ResearchCouncil (MRC) part of UK Research & Innovation (UKRI),the National Institute of Health Research (NIHR),and Genome Research Limited, operating as the Wellcome Sanger Institute. T.I.d.S. is supported by a Well-come Trust Intermediate Clinical Fellowship (110058/Z/15/Z). L.T. is supported by the Wellcome Trust(grant number 205228/Z/16/Z) and by theUniversity of Liverpool Centre for Excellence in Infectious DiseaseResearch (CEIDR). S.D. is funded by an NIHR GlobalResearch Professorship (NIHR300791). L.T. and S.C.M.are also supported by the U.S. Food and Drug Administration Medical Countermeasures Initiative contract75F40120C00085 and the National Institute for Health Research Health Protection Research Unit (HPRU) inEmerging and Zoonotic Infections (NIHR200907) at University of Liverpool inpartnership with Public HealthEngland (PHE), in collaboration with Liverpool School of Tropical Medicine and the University of Oxford.L.T. is based at the University of Liverpool. M.D.P. is funded by the NIHR Sheffield Biomedical ResearchCentre (BRC – IS-BRC-1215-20017). ISARIC4C is supported by the MRC (grant no MC_PC_19059). J.C.K.is a Wellcome Investigator (WT204969/Z/16/Z) and supported by NIHR Oxford Biomedical Research Centreand CIFMS. The views expressed are those of the authors and not necessarily those of the NIHR or MRC

    Whole-genome sequencing reveals host factors underlying critical COVID-19

    Get PDF
    Critical COVID-19 is caused by immune-mediated inflammatory lung injury. Host genetic variation influences the development of illness requiring critical care1 or hospitalization2–4 after infection with SARS-CoV-2. The GenOMICC (Genetics of Mortality in Critical Care) study enables the comparison of genomes from individuals who are critically ill with those of population controls to find underlying disease mechanisms. Here we use whole-genome sequencing in 7,491 critically ill individuals compared with 48,400 controls to discover and replicate 23 independent variants that significantly predispose to critical COVID-19. We identify 16 new independent associations, including variants within genes that are involved in interferon signalling (IL10RB and PLSCR1), leucocyte differentiation (BCL11A) and blood-type antigen secretor status (FUT2). Using transcriptome-wide association and colocalization to infer the effect of gene expression on disease severity, we find evidence that implicates multiple genes—including reduced expression of a membrane flippase (ATP11A), and increased expression of a mucin (MUC1)—in critical disease. Mendelian randomization provides evidence in support of causal roles for myeloid cell adhesion molecules (SELE, ICAM5 and CD209) and the coagulation factor F8, all of which are potentially druggable targets. Our results are broadly consistent with a multi-component model of COVID-19 pathophysiology, in which at least two distinct mechanisms can predispose to life-threatening disease: failure to control viral replication; or an enhanced tendency towards pulmonary inflammation and intravascular coagulation. We show that comparison between cases of critical illness and population controls is highly efficient for the detection of therapeutically relevant mechanisms of disease
    corecore