Search CORE

6 research outputs found

Recommended from our members

Predicting seizure recurrence after an initial seizure-like episode from routine clinical notes using large language models: A retrospective cohort study

Author: Ali Waqar
Alsentzer Emily
Bartmann Ana Paula
Beaulieu-Jones Brett K.
de Jong Johann
Kohane Isaac
Patra Arijit
Scordis Phil
Villamar Mauricio F.
Wissel Benjamin D.
Publication venue
Publication date: 26/11/2023
Field of study

Background: The evaluation and management of first-time seizure-like events in children can be difficult because these episodes are not always directly observed and might be epileptic seizures or other conditions (seizure mimics). We aimed to evaluate whether machine learning models using real-world data could predict seizure recurrence after an initial seizure-like event. Methods: This retrospective cohort study compared models trained and evaluated on two separate datasets between Jan 1, 2010, and Jan 1, 2020: electronic medical records (EMRs) at Boston Children's Hospital and de-identified, patient-level, administrative claims data from the IBM MarketScan research database. The study population comprised patients with an initial diagnosis of either epilepsy or convulsions before the age of 21 years, based on International Classification of Diseases, Clinical Modification (ICD-CM) codes. We compared machine learning-based predictive modelling using structured data (logistic regression and XGBoost) with emerging techniques in natural language processing by use of large language models. Findings: The primary cohort comprised 14 021 patients at Boston Children's Hospital matching inclusion criteria with an initial seizure-like event and the comparison cohort comprised 15 062 patients within the IBM MarketScan research database. Seizure recurrence based on a composite expert-derived definition occurred in 57% of patients at Boston Children's Hospital and 63% of patients within IBM MarketScan. Large language models with additional domain-specific and location-specific pre-training on patients excluded from the study (F1-score 0·826 [95% CI 0·817-0·835], AUC 0·897 [95% CI 0·875-0·913]) performed best. All large language models, including the base model without additional pre-training (F1-score 0·739 [95% CI 0·738-0·741], AUROC 0·846 [95% CI 0·826-0·861]) outperformed models trained with structured data. With structured data only, XGBoost outperformed logistic regression and XGBoost models trained with the Boston Children's Hospital EMR (logistic regression: F1-score 0·650 [95% CI 0·643-0·657], AUC 0·694 [95% CI 0·685-0·705], XGBoost: F1-score 0·679 [0·676-0·683], AUC 0·725 [0·717-0·734]) performed similarly to models trained on the IBM MarketScan database (logistic regression: F1-score 0·596 [0·590-0·601], AUC 0·670 [0·664-0·675], XGBoost: F1-score 0·678 [0·668-0·687], AUC 0·710 [0·703-0·714]). Interpretation: Physician's clinical notes about an initial seizure-like event include substantial signals for prediction of seizure recurrence, and additional domain-specific and location-specific pre-training can significantly improve the performance of clinical large language models, even for specialised cohorts.</p

Knowledge UChicago

Data and sample sharing as an enabler for large-scale biomarker research and development: The EPND perspective

Author: Anthony J Brookes (8198061)
Consortium EPND (14423751)
Niranjan Bose (219032)
Phil Scordis (3415319)
Pieter Jelle Visser (7545980)
Publication venue
Publication date: 30/11/2022
Field of study

Biomarker discovery, development, and validation are reliant on large-scale analyses of high-quality samples and data. Currently, significant quantities of data and samples have been generated by European studies on Alzheimer's disease (AD) and other neurodegenerative diseases (NDD), representing a valuable resource for developing biomarkers to support early detection of disease, treatment monitoring, and patient stratification. However, discovery of, access to, and sharing of data and samples from AD and NDD research are hindered both by silos that limit collaboration, and by the array of complex requirements for secure, legal, and ethical sharing. In this Perspective article, we examine key challenges currently hampering large-scale biomarker research, and outline how the European Platform for Neurodegenerative Diseases (EPND) plans to address them. The first such challenge is a fragmented landscape filled with technical barriers that make it difficult to discover and access high-quality samples and data in one location. A second challenge is related to the complex array of legal and ethical requirements that must be navigated by researchers when sharing data and samples, to ensure compliance with data protection regulations and research ethics. Another challenge is the lack of broad-scale collaboration and opportunities to facilitate partnerships between data and sample contributors and researchers, in addition to a lack of regulatory engagement early in the research process to enable validation of potential biomarkers. A further challenge facing projects is the need to remain sustainable beyond initial funding periods, ensuring data and samples are shared and reused, thereby driving further research and innovation. In addressing these challenges, EPND will enable an environment of faster and more disruptive research on diagnostics and disease-modifying therapies for Alzheimer's disease and other neurodegenerative diseases

Leicester Research Archive

Data and sample sharing as an enabler for large-scale biomarker research and development: The EPND perspective

Author: Anthony J Brookes (8198061)
Consortium EPND (14423751)
Niranjan Bose (219032)
Phil Scordis (3415319)
Pieter Jelle Visser (7545980)
Publication venue: 'Frontiers Media SA'
Publication date: 30/11/2022
Field of study

Maastricht University Research Portal

PubMed Central

Leicester Research Archive

Additional file 1: of PDON: Parkinson’s disease ontology for representation and modeling of the Parkinson’s disease knowledge domain

Author: Alpha Kodamullil (3415316)
Ashutosh Malhotra (437355)
Bernd Müller (340116)
Dieter Scheller (675472)
Erfan Younesi (437356)
Martin Hofmann-Apitius (89705)
Matt Page (3415325)
Michaela Gündel (690900)
Phil Scordis (3415319)
Stephan Springstubbe (3415322)
Ullrich Wüllner (166228)
Publication venue
Publication date
Field of study

Convergence of miRNA Expression Profiling, ?-Synuclein Interacton and GWAS in Parkinson's Disease. (DOCX 449 KB

FigShare

The BIOMarkers in Atopic Dermatitis and Psoriasis (BIOMAP) Glossary: developing a lingua franca to facilitate data harmonisation and cross-cohort analyses

Author: Apfelbacher Christian
Bosma Angela
Broderick Conor
Christian Nils
Dand Nick
Flohr Carsten
Ghosh Soumyabrata
Hangel Nora
Hübenthal Matthias
Middelkamp-Hup M.A
Min Josine
Musters Annelie H
Paternoster Lavinia
Rodríguez Elke
Satagopam Venkata
Scordis Phil
Smith Catherine
Spuls Phyllis Ira
Szymczak Silke
Weidinger Stephan
Publication venue
Publication date: 16/06/2021
Field of study

Dear Editor, BIOMAP (BIOMarkers in Atopic dermatitis and Psoriasis) is a large European consortium aiming to advance personalised medicine for atopic dermatitis and psoriasis by identifying biomarkers which predict therapeutic response and disease progression. BIOMAP brings together clinicians, researchers, patient organisations and pharmaceutical industry partners and encompasses data from over 60 individual studies, including randomised clinical trials, population-based cohorts and deeply-phenotyped disease registries. The curation and harmonisation of data and bio-samples from these established studies will facilitate cross-cohort clinical and molecular analyses, increasing the potential to identify small effect estimates and to better stratify disease subtypes. This letter serves to disseminate BIOMAP's pathway to data harmonisation and will inform future collaborative research endeavours

Open Repository and Bibliography - Luxembourg

PDON: Parkinson’s disease ontology for representation and modeling of the Parkinson’s disease knowledge domain

Author: A Doms
A Malhotra
A Malhotra
Alpha Tom Kodamullil
Ashutosh Malhotra
Bernd Müller
CM Friedrich
D Ferrucci
D Hanisch
D Pal
Dieter Scheller
DL Rubin
Erfan Younesi
G Héja
I Kola
I Spasic
K Eilbeck
KA Fujita
LM Lau de
M Ashburner
M Gündel
M Martins
Martin Hofmann-Apitius
Matt Page
Michaela Gündel
NH Shah
PD Thomas
Phil Scordis
PL Whetzel
R Stevens
S Killcoyne
S Yoshikawa
Stephan Springstubbe
SW Scholz
TR Gruber
Ullrich Wüllner
Y Ma
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref