102 research outputs found
Enhancing Network Initialization for Medical AI Models Using Large-Scale, Unlabeled Natural Images
Pre-training datasets, like ImageNet, have become the gold standard in
medical image analysis. However, the emergence of self-supervised learning
(SSL), which leverages unlabeled data to learn robust features, presents an
opportunity to bypass the intensive labeling process. In this study, we
explored if SSL for pre-training on non-medical images can be applied to chest
radiographs and how it compares to supervised pre-training on non-medical
images and on medical images. We utilized a vision transformer and initialized
its weights based on (i) SSL pre-training on natural images (DINOv2), (ii) SL
pre-training on natural images (ImageNet dataset), and (iii) SL pre-training on
chest radiographs from the MIMIC-CXR database. We tested our approach on over
800,000 chest radiographs from six large global datasets, diagnosing more than
20 different imaging findings. Our SSL pre-training on curated images not only
outperformed ImageNet-based pre-training (P<0.001 for all datasets) but, in
certain cases, also exceeded SL on the MIMIC-CXR dataset. Our findings suggest
that selecting the right pre-training strategy, especially with SSL, can be
pivotal for improving artificial intelligence (AI)'s diagnostic accuracy in
medical imaging. By demonstrating the promise of SSL in chest radiograph
analysis, we underline a transformative shift towards more efficient and
accurate AI models in medical imaging
Empowering Clinicians and Democratizing Data Science: Large Language Models Automate Machine Learning for Clinical Studies
A knowledge gap persists between Machine Learning (ML) developers (e.g., data
scientists) and practitioners (e.g., clinicians), hampering the full
utilization of ML for clinical data analysis. We investigated the potential of
the chatGPT Advanced Data Analysis (ADA), an extension of GPT-4, to bridge this
gap and perform ML analyses efficiently. Real-world clinical datasets and study
details from large trials across various medical specialties were presented to
chatGPT ADA without specific guidance. ChatGPT ADA autonomously developed
state-of-the-art ML models based on the original study's training data to
predict clinical outcomes such as cancer development, cancer progression,
disease complications, or biomarkers such as pathogenic gene sequences.
Strikingly, these ML models matched or outperformed their published
counterparts. We conclude that chatGPT ADA offers a promising avenue to
democratize ML in medicine, making advanced analytics accessible to non-ML
experts and promoting broader applications in medical research and practice
Hydrogen production by chemical-looping reforming in a circulating fluidized bed reactor using Ni-based oxygen carriers
7 pages, 11 figures,.- Available online November 18, 2008.This work presents the experimental results obtained during auto-thermal chemical-looping reforming (CLR) in a 900 Wth circulating fluidized bed reactor under continuous operation using methane as fuel. Two oxygen carriers based on NiO and supported on γ-Al2O3 and α-Al2O3 were used during more than 50 h of operation with each oxygen carrier. During operation the effect of different operating variables, like fuel reactor temperature, H2O/CH4 molar ratio and solid circulation rate, on CH4 conversion and gas product distribution was analyzed. It was found that in all operating conditions CH4 conversion was very high (>98%) and the most important variable affecting to the gas product distribution was the solid circulation rate, that is, NiO/CH4 molar ratio. Similar gas product distribution was obtained working with both oxygen carriers although at different NiO/CH4 molar ratios. The oxygen carrier of NiO on α-Al2O3 needed lower NiO/CH4 molar ratio to reach the same gas product composition than the oxygen carrier of NiO on γ-Al2O3. Working at optimal operating conditions, 2.5 moles of H2 per mol of CH4 could be obtained in this process. During operation the oxygen carrier particles maintained their physical and chemical properties. These results suggest that these oxygen carriers could have a high durability, being suitable oxygen carriers for a CLR system. © 2008 Elsevier B.V. All rights reserved.This work was partially supported by the European Commission, under the 6th Framework Programme (CACHET Project, Contract no. 019972), and from the CCP2 (CO2 Capture Project), a partnership of BP, Chevron, Conoco-Phillips, Eni Technology, Norsk Hydro, Shell, Suncor, and Petrobras. M. Ortiz thanks Diputación General de Aragon for the F.P.I. fellowship.Peer Reviewe
Private, fair and accurate: Training large-scale, privacy-preserving AI models in medical imaging
Artificial intelligence (AI) models are increasingly used in the medical
domain. However, as medical data is highly sensitive, special precautions to
ensure its protection are required. The gold standard for privacy preservation
is the introduction of differential privacy (DP) to model training. Prior work
indicates that DP has negative implications on model accuracy and fairness,
which are unacceptable in medicine and represent a main barrier to the
widespread use of privacy-preserving techniques. In this work, we evaluated the
effect of privacy-preserving training of AI models regarding accuracy and
fairness compared to non-private training. For this, we used two datasets: (1)
A large dataset (N=193,311) of high quality clinical chest radiographs, and (2)
a dataset (N=1,625) of 3D abdominal computed tomography (CT) images, with the
task of classifying the presence of pancreatic ductal adenocarcinoma (PDAC).
Both were retrospectively collected and manually labeled by experienced
radiologists. We then compared non-private deep convolutional neural networks
(CNNs) and privacy-preserving (DP) models with respect to privacy-utility
trade-offs measured as area under the receiver-operator-characteristic curve
(AUROC), and privacy-fairness trade-offs, measured as Pearson's r or
Statistical Parity Difference. We found that, while the privacy-preserving
trainings yielded lower accuracy, they did largely not amplify discrimination
against age, sex or co-morbidity. Our study shows that -- under the challenging
realistic circumstances of a real-life clinical dataset -- the
privacy-preserving training of diagnostic deep learning models is possible with
excellent diagnostic accuracy and fairness.Comment: Published in Communications Medicine. Nature Portfoli
Preserving privacy in domain transfer of medical AI models comes at no performance costs: The integral role of differential privacy
Developing robust and effective artificial intelligence (AI) models in
medicine requires access to large amounts of patient data. The use of AI models
solely trained on large multi-institutional datasets can help with this, yet
the imperative to ensure data privacy remains, particularly as membership
inference risks breaching patient confidentiality. As a proposed remedy, we
advocate for the integration of differential privacy (DP). We specifically
investigate the performance of models trained with DP as compared to models
trained without DP on data from institutions that the model had not seen during
its training (i.e., external validation) - the situation that is reflective of
the clinical use of AI models. By leveraging more than 590,000 chest
radiographs from five institutions, we evaluated the efficacy of DP-enhanced
domain transfer (DP-DT) in diagnosing cardiomegaly, pleural effusion,
pneumonia, atelectasis, and in identifying healthy subjects. We juxtaposed
DP-DT with non-DP-DT and examined diagnostic accuracy and demographic fairness
using the area under the receiver operating characteristic curve (AUC) as the
main metric, as well as accuracy, sensitivity, and specificity. Our results
show that DP-DT, even with exceptionally high privacy levels (epsilon around
1), performs comparably to non-DP-DT (P>0.119 across all domains). Furthermore,
DP-DT led to marginal AUC differences - less than 1% - for nearly all
subgroups, relative to non-DP-DT. Despite consistent evidence suggesting that
DP models induce significant performance degradation for on-domain
applications, we show that off-domain performance is almost not affected.
Therefore, we ardently advocate for the adoption of DP in training diagnostic
medical AI models, given its minimal impact on performance.Comment: Published in Radiology: Artificial Intelligence. RSN
Collaborative Training of Medical Artificial Intelligence Models with non-uniform Labels
Artificial intelligence (AI) methods are revolutionizing medical image
analysis. However, robust AI models require large multi-site datasets for
training. While multiple stakeholders have provided publicly available
datasets, the ways in which these data are labeled differ widely. For example,
one dataset of chest radiographs might contain labels denoting the presence of
metastases in the lung, while another dataset of chest radiograph might focus
on the presence of pneumonia. With conventional approaches, these data cannot
be used together to train a single AI model. We propose a new framework that we
call flexible federated learning (FFL) for collaborative training on such data.
Using publicly available data of 695,000 chest radiographs from five
institutions - each with differing labels - we demonstrate that large and
heterogeneously labeled datasets can be used to train one big AI model with
this framework. We find that models trained with FFL are superior to models
that are trained on matching annotations only. This may pave the way for
training of truly large-scale AI models that make efficient use of all existing
data.Comment: 2 figures, 3 tables, 5 supplementary table
Time-efficient combined morphologic and quantitative joint MRI based on clinical image contrasts -- An exploratory in-situ study of standardized cartilage defects
OBJECTIVES: Quantitative MRI techniques such as T2 and T1 mapping are
beneficial in evaluating cartilage and meniscus. We aimed to evaluate the
MIXTURE (Multi-Interleaved X-prepared Turbo-Spin Echo with IntUitive
RElaxometry) sequences that provide morphologic images with clinical turbo
spin-echo (TSE) contrasts and additional parameter maps versus reference TSE
sequences in an in-situ model of human cartilage defects.
MATERIALS AND METHODS: Prospectively, standardized cartilage defects of 8mm,
5mm, and 3mm diameter were created in the lateral femora of 10 human cadaveric
knee specimens (8110 years, nine male/one female). Using a clinical 3T MRI
scanner and knee coil, MIXTURE sequences combining (i) proton-density weighted
fat-saturated (PD-w FS) images and T2 maps and (ii) T1-weighted images and
T1 maps were acquired before and after defect creation, alongside the
corresponding 2D TSE and 3D TSE reference sequences. Defect delineability, bone
texture, and cartilage relaxation times were quantified. Inter-sequence
comparisons were made using appropriate parametric and non-parametric tests.
RESULTS: Overall, defect delineability and texture features were not
significantly different between the MIXTURE and reference sequences. After
defect creation, relaxation times increased significantly in the central femur
(for T2) and all regions combined (for T1).
CONCLUSION: MIXTURE sequences permit time-efficient simultaneous morphologic
and quantitative joint assessment based on clinical image contrasts. While
providing T2 or T1 maps in clinically feasible scan time, morphologic
image features, i.e., cartilage defect delineability and bone texture, were
comparable between MIXTURE and corresponding reference sequences.Comment: 12 pages (main body), 3 tables, 6 figure
Two for One -- Combined Morphologic and Quantitative Knee Joint MRI Using a Versatile Turbo Spin-Echo Platform
Introduction: Quantitative MRI techniques such as T2 and T1\r{ho} mapping are
beneficial in evaluating knee joint pathologies; however, long acquisition
times limit their clinical adoption. MIXTURE (Multi-Interleaved X-prepared
Turbo-Spin Echo with IntUitive RElaxometry) provides a versatile turbo
spin-echo (TSE) sequence platform for simultaneous morphologic and quantitative
joint imaging yet lacks comparative evaluation in basic and translational
research contexts.
Methods: Two MIXTURE sequences were designed along clinical requirements: (i)
MIX1, combining proton density (PD)-weighted fat-saturated (FS) images and
quantitative T2 mapping (acquisition time: 4:59 min), and (ii) MIX2, combining
T1-weighted images with quantitative T1\r{ho} mapping (6:38 min). MIXTURE
sequences and their reference 2D and 3D TSE counterparts were acquired from ten
human cadaveric knee joints using a clinical 3T MRI scanner and knee coil.
Contrast, contrast-to-noise ratios, and coefficients of variation were
comparatively evaluated using parametric tests. Clinical radiologists (n=3)
assessed diagnostic quality as a function of sequence and anatomic structure
using 5-point Likert scales and ordinal regression. The significance level was
set to {\alpha}=0.01.
Results: MIX1 and MIX2 had at least equal diagnostic quality compared to the
2D and 3D TSE sequences of the same image weighting. Contrast,
contrast-to-noise ratios, and coefficients of variation were largely similar
for the PD-weighted FS and T1-weighted images.
Discussion: In clinically feasible scan times, the MIXTURE sequence platform
yields (i) morphologic images of diagnostic quality and adjustable TSE-based
contrasts and (ii) quantitative parameter mapping with additional insights on
soft tissue composition and ultrastructure.Comment: 13 pages (main text), 7 figures, 3 table
- …