37 research outputs found
Identifying patients with undiagnosed small intestinal neuroendocrine tumours in primary care using statistical and machine learning: model development and validation study
Background: Neuroendocrine tumours (NETs) are increasing in incidence, often diagnosed at advanced stages, and individuals may experience years of diagnostic delay, particularly when arising from the small intestine (SI). Clinical prediction models could present novel opportunities for case finding in primary care. Methods: An open cohort of adults (18+ years) contributing data to the Optimum Patient Care Research Database between 1st Jan 2000 and 30th March 2023 was identified. This database collects de-identified data from general practices in the UK. Model development approaches comprised logistic regression, penalised regression, and XGBoost. Performance (discrimination and calibration) was assessed using internal-external cross-validation. Decision analysis curves compared clinical utility. Results: Of 11.7 million individuals, 382 had recorded SI NET diagnoses (0.003%). The XGBoost model had the highest AUC (0.869, 95% confidence interval [CI]: 0.841–0.898) but was mildly miscalibrated (slope 1.165, 95% CI: 1.088–1.243; calibration-in-the-large 0.010, 95% CI: −0.164 to 0.185). Clinical utility was similar across all models. Discussion: Multivariable prediction models may have clinical utility in identifying individuals with undiagnosed SI NETs using information in their primary care records. Further evaluation including external validation and health economics modelling may identify cost-effective strategies for case finding for this uncommon tumour
Widespread genomic influences on phenotype in Dravet syndrome, a ‘monogenic’ condition
Dravet syndrome is an archetypal rare severe epilepsy, considered “monogenic”, typically caused by loss-of-function SCN1A variants. Despite a recognisable core phenotype, its marked phenotypic heterogeneity is incompletely explained by differences in the causal SCN1A variant or clinical factors. In 34 adults with SCN1A-related Dravet syndrome, we show additional genomic variation beyond SCN1A contributes to phenotype and its diversity, with an excess of rare variants in epilepsy-related genes as a set and examples of blended phenotypes, including one individual with an ultra-rare DEPDC5 variant and focal cortical dysplasia. Polygenic risk scores for intelligence are lower, and for longevity, higher, in Dravet syndrome than in epilepsy controls. The causal, major-effect, SCN1A variant may need to act against a broadly compromised genomic background to generate the full Dravet syndrome phenotype, whilst genomic resilience may help to ameliorate the risk of premature mortality in adult Dravet syndrome survivors
Recommended from our members
Use of whole genome sequencing to determine genetic basis of suspected mitochondrial disorders: cohort study
Funder: University of Cambridge; FundRef: http://dx.doi.org/10.13039/501100000735Funder: Alzheimer's Society; FundRef: http://dx.doi.org/10.13039/501100000320Funder: Leverhulme Trust; FundRef: http://dx.doi.org/10.13039/501100000275Funder: National Institute for Health Research; FundRef: http://dx.doi.org/10.13039/501100000272Funder: Department of Health; FundRef: http://dx.doi.org/10.13039/501100000276Funder: Evelyn Trust; FundRef: http://dx.doi.org/10.13039/501100004282Funder: Wellcome Trust; FundRef: http://dx.doi.org/10.13039/100004440Funder: Medical Research Council; FundRef: http://dx.doi.org/10.13039/501100000265Abstract: Objective: To determine whether whole genome sequencing can be used to define the molecular basis of suspected mitochondrial disease. Design: Cohort study. Setting: National Health Service, England, including secondary and tertiary care. Participants: 345 patients with suspected mitochondrial disorders recruited to the 100 000 Genomes Project in England between 2015 and 2018. Intervention: Short read whole genome sequencing was performed. Nuclear variants were prioritised on the basis of gene panels chosen according to phenotypes, ClinVar pathogenic/likely pathogenic variants, and the top 10 prioritised variants from Exomiser. Mitochondrial DNA variants were called using an in-house pipeline and compared with a list of pathogenic variants. Copy number variants and short tandem repeats for 13 neurological disorders were also analysed. American College of Medical Genetics guidelines were followed for classification of variants. Main outcome measure: Definite or probable genetic diagnosis. Results: A definite or probable genetic diagnosis was identified in 98/319 (31%) families, with an additional 6 (2%) possible diagnoses. Fourteen of the diagnoses (4% of the 319 families) explained only part of the clinical features. A total of 95 different genes were implicated. Of 104 families given a diagnosis, 39 (38%) had a mitochondrial diagnosis and 65 (63%) had a non-mitochondrial diagnosis. Conclusion: Whole genome sequencing is a useful diagnostic test in patients with suspected mitochondrial disorders, yielding a diagnosis in a further 31% after exclusion of common causes. Most diagnoses were non-mitochondrial disorders and included developmental disorders with intellectual disability, epileptic encephalopathies, other metabolic disorders, cardiomyopathies, and leukodystrophies. These would have been missed if a targeted approach was taken, and some have specific treatments
Recommended from our members
Spectrum of mutational signatures in T-cell lymphoma reveals a key role for UV radiation in cutaneous T-cell lymphoma
Funder: Galderma; doi: http://dx.doi.org/10.13039/501100009754Funder: NIHR-BRC Cambridge core grantFunder: National Institute for Health Research; doi: http://dx.doi.org/10.13039/501100000272Funder: NHS EnglandAbstract: T-cell non-Hodgkin’s lymphomas develop following transformation of tissue resident T-cells. We performed a meta-analysis of whole exome sequencing data from 403 patients with eight subtypes of T-cell non-Hodgkin’s lymphoma to identify mutational signatures and associated recurrent gene mutations. Signature 1, indicative of age-related deamination, was prevalent across all T-cell lymphomas, reflecting the derivation of these malignancies from memory T-cells. Adult T-cell leukemia-lymphoma was specifically associated with signature 17, which was found to correlate with the IRF4 K59R mutation that is exclusive to Adult T-cell leukemia-lymphoma. Signature 7, implicating UV exposure was uniquely identified in cutaneous T-cell lymphoma (CTCL), contributing 52% of the mutational burden in mycosis fungoides and 23% in Sezary syndrome. Importantly this UV signature was observed in CD4 + T-cells isolated from the blood of Sezary syndrome patients suggesting extensive re-circulation of these T-cells through skin and blood. Analysis of non-Hodgkin’s T-cell lymphoma cases submitted to the national 100,000 WGS project confirmed that signature 7 was only identified in CTCL strongly implicating UV radiation in the pathogenesis of cutaneous T-cell lymphoma
Supplementary data for "The diagnostic odyssey in children and adolescents with X-linked hypophosphataemia: population-based, case-control study"
This study explored the recording of clinical features and the diagnostic odyssey of children and adolescents with X-linked hypophosphataemia in primary care electronic healthcare records in the United Kingdom.</p
A machine learning algorithm for the detection of paroxysmal nocturnal haemoglobinuria (PNH) in UK primary care electronic health records
Abstract Background Paroxysmal Nocturnal Haemoglobinuria (PNH) is an ultra-rare, acquired disorder that is challenging to diagnose due to varied symptoms, heterogeneous patient presentations, and lack of awareness among healthcare professionals. This leads to frequent misdiagnosis and delays in diagnosis. This study evaluated the feasibility of a machine learning model to identify undiagnosed PNH patients using structured electronic health records. Methods The study used data from the Optimum Patient Care Research Database, which contains electronic health records from general practitioner (GP) practices across the United Kingdom. PNH patients were identified by the presence, and control patients by the absence of a PNH diagnosis code in their records. Clinical features (symptoms, diagnoses, healthcare utilisation) from 131 patients in the PNH group and 593,838 patients in the control group, were inputted to a tree-based XGBoost machine learning model to classify patients as either “positive” or “negative” for PNH suspicion. The algorithm was finalised after additional exclusions and inclusions applied. Performance was assessed using positive predictive value (PPV), recall and specificity. As the sample used to develop the algorithm was not representative of the true population prevalence, PPV was additionally adjusted to reflect performance in the wider population. Results Of all the patients in the PNH group, 27% were classified as positive (recall). 99.99% of the control group were classified as negative (specificity). Of all the patients classified as positive, 60.4% had a diagnosis of PNH in their record (PPV). The PPV adjusted for the population prevalence of PNH was 19.59 suggesting nearly 1 in 5 patients flagged may warrant further PNH investigation. The key clinical features in the model were aplastic anaemia, pancytopenia, haemolytic anaemia, myelodysplastic syndrome, and Budd-Chiari syndrome. Conclusion This is the first study to combine clinical understanding of PNH with machine learning, demonstrating the ability to discriminate between PNH and control patients in retrospective electronic health records. With further investigation and validation, this algorithm could be deployed on live health data, potentially leading to earlier diagnosis for patients who currently experience long diagnostic delays or remain undiagnosed
Author response for "Multiple endocrine neoplasia type 1 ( <scp> <i>MEN1</i> </scp> ) 5’ <scp>UTR</scp> deletion, in <scp>MEN1</scp> family, decreases menin expression"
Whole genome sequences discriminate hereditary hemorrhagic telangiectasia phenotypes by non-HHT deleterious DNA variation
The abnormal vascular structures of hereditary hemorrhagic telangiectasia (HHT) often cause severe anemia due to recurrent hemorrhage, but HHT causal genes do not predict the severity of hematological complications. We tested for chance inheritance and clinical associations of rare deleterious variants in which loss-of-function causes bleeding or hemolytic disorders in the general population. In double-blinded analyses, all 104 patients with HHT from a single reference center recruited to the 100 000 Genomes Project were categorized on new MALO (more/as-expected/less/opposite) sub-phenotype severity scales, and whole genome sequencing data were tested for high impact variants in 75 HHT-independent genes encoding coagulation factors, or platelet, hemoglobin, erythrocyte enzyme, and erythrocyte membrane constituents. Rare variants (all gnomAD allele frequencies 15 were supported by gene-level mutation significance cutoff scores. CADD >15 variants were identified in 38/104 (36.5%) patients with HHT, found for 1 in 10 patients within platelet genes; 1 in 8 within coagulation genes; and 1 in 4 within erythrocyte hemolytic genes. In blinded analyses, patients with greater hemorrhagic severity that had been attributed solely to HHT vessels had more CADD-deleterious variants in platelet (Spearman ρ = 0.25; P = .008) and coagulation (Spearman ρ = 0.21; P = .024) genes. However, the HHT cohort had 60% fewer deleterious variants in platelet and coagulation genes than expected (Mann-Whitney test P = .021). In conclusion, patients with HHT commonly have rare variants in genes of relevance to their phenotype, offering new therapeutic targets and opportunities for informed, personalized medicine strategies
Whole genome sequences discriminate hereditary hemorrhagic telangiectasia phenotypes by non-HHT deleterious DNA variation
AbstractThe abnormal vascular structures of hereditary hemorrhagic telangiectasia (HHT) often cause severe anemia due to recurrent hemorrhage, but HHT causal genes do not predict the severity of hematological complications. We tested for chance inheritance and clinical associations of rare deleterious variants in which loss-of-function causes bleeding or hemolytic disorders in the general population. In double-blinded analyses, all 104 patients with HHT from a single reference center recruited to the 100 000 Genomes Project were categorized on new MALO (more/as-expected/less/opposite) sub-phenotype severity scales, and whole genome sequencing data were tested for high impact variants in 75 HHT-independent genes encoding coagulation factors, or platelet, hemoglobin, erythrocyte enzyme, and erythrocyte membrane constituents. Rare variants (all gnomAD allele frequencies &lt;0.003) were identified in 56 (75%) of these 75 HHT-unrelated genes. Deleteriousness assignments by Combined Annotation Dependent Depletion (CADD) scores &gt;15 were supported by gene-level mutation significance cutoff scores. CADD &gt;15 variants were identified in 38/104 (36.5%) patients with HHT, found for 1 in 10 patients within platelet genes; 1 in 8 within coagulation genes; and 1 in 4 within erythrocyte hemolytic genes. In blinded analyses, patients with greater hemorrhagic severity that had been attributed solely to HHT vessels had more CADD-deleterious variants in platelet (Spearman ρ = 0.25; P = .008) and coagulation (Spearman ρ = 0.21; P = .024) genes. However, the HHT cohort had 60% fewer deleterious variants in platelet and coagulation genes than expected (Mann-Whitney test P = .021). In conclusion, patients with HHT commonly have rare variants in genes of relevance to their phenotype, offering new therapeutic targets and opportunities for informed, personalized medicine strategies.</jats:p
