Search CORE

20 research outputs found

<sup>18</sup>F-FDG PET baseline radiomics features improve the prediction of treatment outcome in diffuse large B-cell lymphoma

Author: Boellaard Ronald
de Vet Henrica C. W.
Eertink Jakoba J.
Hoekstra Otto S.
Lugtenburg Pieternella J.
Pfaehler Elisabeth A. G.
van de Brug Tim
van der Holt Bronno
Wiegers Sanne E.
Zijlstra Josee M.
Zwezerijnen Gerben J. C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

PURPOSE: Accurate prognostic markers are urgently needed to identify diffuse large B-Cell lymphoma (DLBCL) patients at high risk of progression or relapse. Our purpose was to investigate the potential added value of baseline radiomics features to the international prognostic index (IPI) in predicting outcome after first-line treatment. METHODS: Three hundred seventeen newly diagnosed DLBCL patients were included. Lesions were delineated using a semi-automated segmentation method (standardized uptake value ≥ 4.0), and 490 radiomics features were extracted. We used logistic regression with backward feature selection to predict 2-year time to progression (TTP). The area under the curve (AUC) of the receiver operator characteristic curve was calculated to assess model performance. High-risk groups were defined based on prevalence of events; diagnostic performance was assessed using positive and negative predictive values. RESULTS: The IPI model yielded an AUC of 0.68. The optimal radiomics model comprised the natural logarithms of metabolic tumor volume (MTV) and of SUV(peak) and the maximal distance between the largest lesion and any other lesion (Dmax(bulk), AUC 0.76). Combining radiomics and clinical features showed that a combination of tumor- (MTV, SUV(peak) and Dmax(bulk)) and patient-related parameters (WHO performance status and age > 60 years) performed best (AUC 0.79). Adding radiomics features to clinical predictors increased PPV with 15%, with more accurate selection of high-risk patients compared to the IPI model (progression at 2-year TTP, 44% vs 28%, respectively). CONCLUSION: Prediction models using baseline radiomics combined with currently used clinical predictors identify patients at risk of relapse at baseline and significantly improved model performance. TRIAL REGISTRATION NUMBER AND DATE: EudraCT: 2006–005,174-42, 01–08-2008. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00259-021-05480-3

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

PubMed Central

EUR Research Repository

Dissertations of the University of Groningen

Pregnancy outcomes in women with Budd-Chiari syndrome or portal vein thrombosis A multicentre retrospective cohort study

Author: Damhuis Stefanie E
Darwish Murad Sarwa
Duvekot Johannes J
Ganzevoort Wessel
Gordijn Sanne J
Hamulyák Eva N
Hutten Barbara A
Lauw Mandy N
Leentjens Jenneke
Middeldorp Saskia
Scheres Luuk J J
van Duuren Jack R
Wiegers Hanke M G
Publication venue: 'Wiley'
Publication date: 01/01/2021
Field of study

OBJECTIVE: To evaluate current practice and outcomes of pregnancy in women previously diagnosed with Budd-Chiari syndrome and/or portal vein thrombosis, with and without concomitant portal hypertension. DESIGN AND SETTING: Multicentre retrospective cohort study between 2008-2021. POPULATION: Women who conceived in the predefined period after the diagnosis of Budd-Chiari syndrome and/or portal vein thrombosis. METHODS AND MAIN OUTCOME MEASURES: We collected data on diagnosis and clinical features. The primary outcomes were maternal mortality and live birth rate. Secondary outcomes included maternal, neonatal and obstetric complications. RESULTS: Forty-five women (12 Budd-Chiari syndrome, 33 portal vein thrombosis; 76 pregnancies) were included. Underlying prothrombotic disorders were present in 23 of 45 women (51%). Thirty-eight women (84%) received low-molecular-weight heparin during pregnancy. Of 45 first pregnancies, 11 (24%) ended in pregnancy loss and 34 (76%) resulted in live birth of which 27 at term age (79% of live births and 60% of pregnancies). No maternal deaths were observed, one woman developed pulmonary embolism during pregnancy and two women (4%) had variceal bleeding requiring intervention. CONCLUSIONS: The high number of term live births (79%) and lower than expected risk of pregnancy-related maternal and neonatal morbidity in our cohort suggest that Budd-Chiari syndrome and/or portal vein thrombosis should not be considered as an absolute contra-indication for pregnancy. Individualized, nuanced counselling and a multidisciplinary pregnancy surveillance approach are essential in this patient population

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

EUR Research Repository

Dissertations of the University of Groningen

Baseline radiomics features and MYC rearrangement status predict progression in aggressive B-cell lymphoma

Author: Boellaard Ronald
Chamuleau Martine E.D.
de Jong Daphne
de Vet Henrica C.W.
Dührsen Ulrich
Eertink Jakoba Johanna
Eertink Jakoba Johanna
Hanoun Christine
Hoekstra Otto S.
Hüttmann Andreas
Jauw Yvonne W.S.
Klapper Wolfram
Lugtenburg Pieternella J.
Mendeville Matias
Pieplenbosch Simone
Richter Julia
Wiegers Sanne E.
Ylstra Bauke
Zijlstra Josée M.
Zwezerijnen Gerben J.C.
Publication venue: 'American Society of Hematology'
Publication date: 02/11/2022
Field of study

We investigated whether the outcome prediction of patients with aggressive B-cell lymphoma can be improved by combining clinical, molecular genotype, and radiomics features. MYC, BCL2, and BCL6 rearrangements were assessed using fluorescence in situ hybridization. Seventeen radiomics features were extracted from the baseline positron emission tomography–computed tomography of 323 patients, which included maximum standardized uptake value (SUV(max)), SUV(peak), SUV(mean), metabolic tumor volume (MTV), total lesion glycolysis, and 12 dissemination features pertaining to distance, differences in uptake and volume between lesions, respectively. Logistic regression with backward feature selection was used to predict progression after 2 years. The predictive value of (1) International Prognostic Index (IPI); (2) IPI plus MYC; (3) IPI, MYC, and MTV; (4) radiomics; and (5) MYC plus radiomics models were tested using the cross-validated area under the curve (CV-AUC) and positive predictive values (PPVs). IPI yielded a CV-AUC of 0.65 ± 0.07 with a PPV of 29.6%. The IPI plus MYC model yielded a CV-AUC of 0.68 ± 0.08. IPI, MYC, and MTV yielded a CV-AUC of 0.74 ± 0.08. The highest model performance of the radiomics model was observed for MTV combined with the maximum distance between the largest lesion and another lesion, the maximum difference in SUV(peak) between 2 lesions, and the sum of distances between all lesions, yielding an improved CV-AUC of 0.77 ± 0.07. The same radiomics features were retained when adding MYC (CV-AUC, 0.77 ± 0.07). PPV was highest for the MYC plus radiomics model (50.0%) and increased by 20% compared with the IPI (29.6%). Adding radiomics features improved model performance and PPV and can, therefore, aid in identifying poor prognosis patients

EUR Research Repository

PubMed Central

Combatting the effect of image reconstruction settings on lymphoma [18F]FDG PET metabolic tumor volume assessment using various segmentation methods

Author: Boellaard Ronald
Eertink Jakoba J.
Ferrandez Maria C.
Golla Sandeep S., V
Pieplenbosch Simone
Wiegers Sanne E.
Zijlstra Josee M.
Zwezerijnen Gerben J. C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/07/2022
Field of study

Background: [18F]FDG PET-based metabolic tumor volume (MTV) is a promising prognostic marker for lymphoma patients. The aim of this study is to assess the sensitivity of several MTV segmentation methods to variations in image reconstruction methods and the ability of ComBat to improve MTV reproducibility. Methods: Fifty-six lesions were segmented from baseline [18F]FDG PET scans of 19 lymphoma patients. For each scan, EARL1 and EARL2 standards and locally clinically preferred reconstruction protocols were applied. Lesions were delineated using 9 semiautomatic segmentation methods: fixed threshold based on standardized uptake value (SUV), (SUV = 4, SUV = 2.5), relative threshold (41% of SUVmax [41M], 50% of SUVpeak [A50P]), majority vote-based methods that select voxels detected by at least 2 (MV2) and 3 (MV3) out of the latter 4 methods, Nestle thresholding, and methods that identify the optimal method based on SUVmax (L2A, L2B). MTVs from EARL2 and locally clinically preferred reconstructions were compared to those from EARL1. Finally, different versions of ComBat were explored to harmonize the data. Results: MTVs from the SUV4.0 method were least sensitive to the use of different reconstructions (MTV ratio: median = 1.01, interquartile range = [0.96–1.10]). After ComBat harmonization, an improved agreement of MTVs among different reconstructions was found for most segmentation methods. The regular implementation of ComBat (‘Regular ComBat’) using non-transformed distributions resulted in less accurate and precise MTV alignments than a version using log-transformed datasets (‘Log-transformed ComBat’). Conclusion: MTV depends on both segmentation method and reconstruction methods. ComBat reduces reconstruction dependent MTV variability, especially when log-transformation is used to account for the non-normal distribution of MTVs

PubMed Central

Quantitative Radiomics Features in Diffuse Large B-Cell Lymphoma: Does Segmentation Method Matter?

Author: Boellaard Ronald
de Vet Henrica C.W.
Eertink Jakoba J.
Hoekstra Otto S.
Lugtenburg Pieternella J.
Pfaehler Elisabeth A.G.
van de Brug Tim
Wiegers Sanne E.
Zijlstra Josée M.
Publication venue: 'Society of Nuclear Medicine'
Publication date: 01/03/2022
Field of study

Radiomics features may predict outcome in diffuse large B-cell lymphoma (DLBCL). Currently, multiple segmentation methods are used to calculate metabolic tumor volume (MTV). We assessed the influence of segmentation method on the discriminative power of radiomics features in DLBCL at the patient level and for the largest lesion. Methods: Fifty baseline 18F-FDG PET/CT scans of DLBCL patients with progression or relapse within 2 years after diagnosis were matched on uptake time and reconstruction method with 50 baseline PET/CT scans of DLBCL patients without progression. Scans were analyzed using 6 semiautomatic segmentation methods (SUV threshold of 4.0 [SUV4.0], SUV threshold of 2.5, 41% of SUVmax, 50% of SUVpeak, a majority vote segmenting voxels detected by

2methods, and a majority vote segmenting voxels detected by

3 methods). On the basis of these segmentations, 490 radiomics features were extracted at the patient level, and 486 features were extracted for the largest lesion. To quantify the agreement between features extracted from different segmentation methods, the intraclass correlation (ICC) agreement was calculated for each method compared with SUV4.0. The feature space was reduced by deleting features that had high Pearson correlations ($0.7) with the previously established predictors MTV or SUVpeak. Model performance was assessed using stratified repeated cross validation with 5 folds and 2,000 repeats, yielding the mean receiver-operating-characteristics curve integral for all segmentation methods using logistic regression with backward feature selection. Results: The percentage of features yielding an ICC of at least 0.75, compared with the SUV4.0 segmentation, was lowest for 50% of SUVpeak both at the patient level and for the largest lesion, with 77.3% and 66.7% of the features yielding an ICC of at least 0.75, respectively. Features did not correlate strongly with MTV, with at least 435 features at the patient level and 409 features for the largest lesion for all segmentation methods having a correlation coefficient of less than 0.7. Features correlated strongly with SUVpeak (at least 190 at patient level and 134 for the largest lesion were uncorrelated to SUVpeak, respectively). Receiver-operatingcharacteristics curve integrals ranged between 0.6960.11 and 0.8460.09 at the patient level and between 0.6960.11 and 0.7360.10 at the lesion level. Conclusion: Even though there are differences in the actual radiomics feature values derived and selected features among segmentation methods, there is no substantial difference in the discriminative power of radiomics features among segmentation methods

Quantitative radiomics features in diffuse large B-cell lymphoma: does segmentation method matter?

Author: Boellaard Ronald
de Vet Henrica C. W.
Eertink Jakoba J.
Hoekstra Otto S
Lugtenburg Pieternella J.
Pfaehler Elisabeth A. G.
van de Brug Tim
Wiegers Sanne E.
Zijlstra Josee M.
Publication venue: 'Society of Nuclear Medicine'
Publication date: 01/01/2021
Field of study

2methods, and a majority vote segmenting voxels detected by

OPUS Augsburg

EUR Research Repository

Interobserver agreement in automated metabolic tumor volume measurements of Deauville score 4 and 5 lesions at interim 18F-FDG PET in DLBCL

Author: Boellaard Ronald
Burggraaff Coreline N.
de Vet Henrica C.W.
Eertink Jakoba J.
Hoekstra Otto S.
Lugtenburg Pieternella J.
Oprea-Lager Daniela E.
Pieplenbosch Simone
Shaban Ekhlas A.I.N.
Wiegers Sanne E.
Zijlstra Josee M.
Zwezerijnen Gerben J.C.
Publication venue: 'Society of Nuclear Medicine'
Publication date: 01/11/2021
Field of study

Metabolic tumor volume (MTV) on interim-PET (I-PET) is a potential prognostic biomarker for diffuse large B-cell lymphoma (DLBCL). Implementation of MTV on I-PET requires consensus which semi-automated segmentation method delineates lesions most successfully with least user interaction. Methods used for baseline PET are not necessarily optimal for I-PET due to lower lesional standardized uptake values (SUV) at I-PET. Therefore, we aimed to evaluate which method provides the best delineation quality of Deauville-score (DS) 4-5 DLBCL lesions on I-PET at best interobserver agreement on delineation quality and, secondly, to assess the effect of lesional SUVmax on delineation quality and performance agreements. Methods: DS4-5 lesions from 45 I-PET scans were delineated using six semi-automated methods i) SUV 2.5, ii) SUV 4.0, iii) adaptive threshold [A50%peak], iv) 41% of maximum SUV [41%max], v) majority vote including voxels detected by ≥2 methods [MV2] and vi) detected by ≥3 methods [MV3]. Delineation quality per MTV was rated by three independent observers as acceptable or non-acceptable. For each method, observer scores on delineation quality, specific agreements and MTV were assessed for all lesions, and per category of lesional SUVmax (10). Results: In 60 DS4-5 lesions on I-PET, MV3 performed best, with acceptable delineation in 90% of lesions, with a positive agreement (PA) of 93%. Delineation quality scores and agreements per method strongly depended on lesional SUV: the best delineation quality scores were obtained using MV3 in lesions with SUVmax10, were comparable after excluding visually failed MV3 contouring. For lesions with SUVmax<10, MTVs using different methods correlated poorly. Conclusion: On I-PET, MV3 performed best and provided the highest interobserver agreement regarding acceptable delineations of DS4-5 DLBCL lesions. However, delineation method preference strongly depended on lesional SUV. Therefore, we suggest to explore an approach that identifies the optimal delineation method per lesion as function of tumor FDG uptake characteristics, i.e. SUVmax

Interobserver agreement in automated metabolic tumor volume measurements of Deauville score 4 and 5 lesions at interim 18F-FDG PET in DLBCL

Author: Boellaard Ronald
Burggraaff Coreline N
de Vet Henrica Cw
Eertink Jakoba J
Hoekstra Otto S
Lugtenburg Pieternella J
Oprea-Lager Daniela E
Pieplenbosch Simone
Shaban Ekhlas A
Wiegers Sanne E
Zijlstra Josee M
Zwezerijnen Gerben Jc
Publication venue: 'Society of Nuclear Medicine'
Publication date: 05/03/2021
Field of study

Introduction: Metabolic tumor volume (MTV) on interim-PET (I-PET) is a potential prognostic biomarker for diffuse large B-cell lymphoma (DLBCL). Implementation of MTV on I-PET requires consensus which semi-automated segmentation method delineates lesions most successfully with least user interaction. Methods used for baseline PET are not necessarily optimal for I-PET due to lower lesional standardized uptake values (SUV) at I-PET. Therefore, we aimed to evaluate which method provides the best delineation quality of Deauville-score (DS) 4-5 DLBCL lesions on I-PET at best interobserver agreement on delineation quality and, secondly, to assess the effect of lesional SUVmax on delineation quality and performance agreements. Methods: DS4-5 lesions from 45 I-PET scans were delineated using six semi-automated methods i) SUV 2.5, ii) SUV 4.0, iii) adaptive threshold [A50%peak], iv) 41% of maximum SUV [41%max], v) majority vote including voxels detected by ≥2 methods [MV2] and vi) detected by ≥3 methods [MV3]. Delineation quality per MTV was rated by three independent observers as acceptable or non-acceptable. For each method, observer scores on delineation quality, specific agreements and MTV were assessed for all lesions, and per category of lesional SUVmax (10). Results: In 60 DS4-5 lesions on I-PET, MV3 performed best, with acceptable delineation in 90% of lesions, with a positive agreement (PA) of 93%. Delineation quality scores and agreements per method strongly depended on lesional SUV: the best delineation quality scores were obtained using MV3 in lesions with SUVmax10, were comparable after excluding visually failed MV3 contouring. For lesions with SUVmax<10, MTVs using different methods correlated poorly. Conclusion: On I-PET, MV3 performed best and provided the highest interobserver agreement regarding acceptable delineations of DS4-5 DLBCL lesions. However, delineation method preference strongly depended on lesional SUV. Therefore, we suggest to explore an approach that identifies the optimal delineation method per lesion as function of tumor FDG uptake characteristics, i.e. SUVmax

PubMed Central

EUR Research Repository

Sensitivity of an AI method for [18F]FDG PET/CT outcome prediction of diffuse large B-cell lymphoma patients to image reconstruction protocols

Author: Bart M. de Vries
Gerben J. C. Zwezerijnen
Jakoba J. Eertink
Josée M. Zijlstra
Louise Schilder
Maria C. Ferrández
Martijn W. Heymans
Ronald Boellaard
Sandeep S. V. Golla
Sanne E. Wiegers
Simone Pieplenbosch
Publication venue: SpringerOpen
Publication date: 01/09/2023
Field of study

Abstract Background Convolutional neural networks (CNNs), applied to baseline [18F]-FDG PET/CT maximum intensity projections (MIPs), show potential for treatment outcome prediction in diffuse large B-cell lymphoma (DLBCL). The aim of this study is to investigate the robustness of CNN predictions to different image reconstruction protocols. Baseline [18F]FDG PET/CT scans were collected from 20 DLBCL patients. EARL1, EARL2 and high-resolution (HR) protocols were applied per scan, generating three images with different image qualities. Image-based transformation was applied by blurring EARL2 and HR images to generate EARL1 compliant images using a Gaussian filter of 5 and 7 mm, respectively. MIPs were generated for each of the reconstructions, before and after image transformation. An in-house developed CNN predicted the probability of tumor progression within 2 years for each MIP. The difference in probabilities per patient was then calculated between both EARL2 and HR with respect to EARL1 (delta probabilities or ΔP). We compared these to the probabilities obtained after aligning the data with ComBat using the difference in median and interquartile range (IQR). Results CNN probabilities were found to be sensitive to different reconstruction protocols (EARL2 ΔP: median = 0.09, interquartile range (IQR) = [0.06, 0.10] and HR ΔP: median = 0.1, IQR = [0.08, 0.16]). Moreover, higher resolution images (EARL2 and HR) led to higher probability values. After image-based and ComBat transformation, an improved agreement of CNN probabilities among reconstructions was found for all patients. This agreement was slightly better after image-based transformation (transformed EARL2 ΔP: median = 0.022, IQR = [0.01, 0.02] and transformed HR ΔP: median = 0.029, IQR = [0.01, 0.03]). Conclusion Our CNN-based outcome predictions are affected by the applied reconstruction protocols, yet in a predictable manner. Image-based harmonization is a suitable approach to harmonize CNN predictions across image reconstruction protocols

Directory of Open Access Journals

Reproducibility of [18F]FDG PET/CT liver SUV as reference or normalisation factor

Author: Boellaard Ronald
Burggraaff Coreline N.
de Vet Henrica C. W.
Eertink Jakoba J.
Ferrández Maria C.
Heymans Martijn W.
Lugtenburg Pieternella J.
Wiegers Sanne E.
Zijlstra Josée M.
Zwezerijnen Gerben J. C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/09/2022
Field of study

Introduction: Although visual and quantitative assessments of [18F]FDG PET/CT studies typically rely on liver uptake value as a reference or normalisation factor, consensus or consistency in measuring [18F]FDG uptake is lacking. Therefore, we evaluate the variation of several liver standardised uptake value (SUV) measurements in lymphoma [18F]FDG PET/CT studies using different uptake metrics. Methods: PET/CT scans from 34 lymphoma patients were used to calculate SUVmaxliver, SUVpeakliver and SUVmeanliver as a function of (1) volume-of-interest (VOI) size, (2) location, (3) imaging time point and (4) as a function of total metabolic tumour volume (MTV). The impact of reconstruction protocol on liver uptake is studied on 15 baseline lymphoma patient scans. The effect of noise on liver SUV was assessed using full and 25% count images of 15 lymphoma scans. Results: Generally, SUVmaxliver and SUVpeakliver were 38% and 16% higher compared to SUVmeanliver. SUVmaxliver and SUVpeakliver increased up to 31% and 15% with VOI size while SUVmeanliver remained unchanged with the lowest variability for the largest VOI size. Liver uptake metrics were not affected by VOI location. Compared to baseline, liver uptake metrics were 15–18% and 9–18% higher at interim and EoT PET, respectively. SUVliver decreased with larger total MTVs. SUVmaxliver and SUVpeakliver were affected by reconstruction protocol up to 62%. SUVmax and SUVpeak moved 22% and 11% upward between full and 25% count images. Conclusion: SUVmeanliver was most robust against VOI size, location, reconstruction protocol and image noise level, and is thus the most reproducible metric for liver uptake. The commonly recommended 3 cm diameter spherical VOI-based SUVmeanliver values were only slightly more variable than those seen with larger VOI sizes and are sufficient for SUVmeanliver measurements in future studies. Trial registration: EudraCT: 2006–005,174-42, 01–08-2008

EUR Research Repository