Search CORE

19 research outputs found

Confound-leakage: confound removal in machine learning leads to leakage

Author: Eickhoff SB
Hamdan S
Love BC
Patil KR
Schwender H
von Polier GG
Weis S
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2023
Field of study

BACKGROUND: Machine learning (ML) approaches are a crucial component of modern data analysis in many fields, including epidemiology and medicine. Nonlinear ML methods often achieve accurate predictions, for instance, in personalized medicine, as they are capable of modeling complex relationships between features and the target. Problematically, ML models and their predictions can be biased by confounding information present in the features. To remove this spurious signal, researchers often employ featurewise linear confound regression (CR). While this is considered a standard approach for dealing with confounding, possible pitfalls of using CR in ML pipelines are not fully understood. RESULTS: We provide new evidence that, contrary to general expectations, linear confound regression can increase the risk of confounding when combined with nonlinear ML approaches. Using a simple framework that uses the target as a confound, we show that information leaked via CR can increase null or moderate effects to near-perfect prediction. By shuffling the features, we provide evidence that this increase is indeed due to confound-leakage and not due to revealing of information. We then demonstrate the danger of confound-leakage in a real-world clinical application where the accuracy of predicting attention-deficit/hyperactivity disorder is overestimated using speech-derived features when using depression as a confound. CONCLUSIONS: Mishandling or even amplifying confounding effects when building ML models due to confound-leakage, as shown, can lead to untrustworthy, biased, and unfair predictions. Our expose of the confound-leakage pitfall and provided guidelines for dealing with it can help create more robust and trustworthy ML models

UCL Discovery

Evidence for similar structural brain anomalies in youth and adult attention-deficit/hyperactivity disorder: a machine learning analysis

Author: Ambrosino S
Asherson P
Banaschewski T
Baranov A
Baumeister S
Baur-Streubel R
Bellgrove MA
Biederman J
Bralten J
Bramati IE
Brandeis D
Brem S
Buitelaar JK
Busatto GF
Calvo A
Castellanos FX
Cercignani M
Chaim-Avancini TM
Chantiluke KC
Christakou A
Coghill D
Conzelmann A
Cubillo AI
Dale AM
de Zeeuw P
Doyle AE
Durston S
Earl EA
Epstein JN
Ethofer T
Fair DA
Fallgatter AJ
Frodl T
Gabel MC
Gogberashvili T
Haavik J
Harrison NA
Hartman CA
Helminen EC
Heslenfeld DJ
Hoekstra PJ
Hohmann S
Høvik MF
Jahanshad N
Jernigan TL
Kardatzki B
Karkashadze G
Kelly C
Kohls G
Konrad K
Kuntsi J
Lazaro L
Lera-Miguel S
Lesch KP
Liu J
Louza MR
Lundervold AJ
Malpas CB
Mattos P
McCarthy H
Mehta MA
Namazova-Baranova L
Nicolau R
Nigg JT
Novotny SE
Oosterlaan J
Oranje B
O’Gorman Tuura RL
Paloyelis Y
Pauli P
Plessen KJ
Ramos-Quiroga JA
Reif A
Reneman L
Rosa PGP
Rubia K
Schrantee A
Schulte-Rutte M
Schwarz L
Schweren LJS
Seitz J
Shaw P
Silk Timothy
Skokauskas N
Stevens MC
Sudre G
Tamm L
Thompson PM
Tovar-Moll F
van Erp TGM
Vance A
Vila JCS
Vilarroya O
Vives-Gilabert Y
von Polier GG
Walitza S
Weiss EO
Yoncheva YN
Zanetti MV
Zhang-James Y
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Attention-deficit/hyperactivity disorder (ADHD) affects 5% of children world-wide. Of these, two-thirds continue to have impairing symptoms of ADHD into adulthood. Although a large literature implicates structural brain differences of the disorder, it is not clear if adults with ADHD have similar neuroanatomical differences as those seen in children with recent reports from the large ENIGMA-ADHD consortium finding structural differences for children but not for adults. This paper uses deep learning neural network classification models to determine if there are neuroanatomical changes in the brains of children with ADHD that are also observed for adult ADHD, and vice versa. We found that structural MRI data can significantly separate ADHD from control participants for both children and adults. Consistent with the prior reports from ENIGMA-ADHD, prediction performance and effect sizes were better for the child than the adult samples. The model trained on adult samples significantly predicted ADHD in the child sample, suggesting that our model learned anatomical features that are common to ADHD in childhood and adulthood. These results support the continuity of ADHD’s brain differences from childhood to adulthood. In addition, our work demonstrates a novel use of neural network classification models to test hypotheses about developmental continuity

University of Groningen

Serveur académique lausannois

Juelich Shared Electronic Resources

UPF Digital Repository

Sussex Research Online

Repository for Publications and Research Data

Central Archive at the University of Reading

Deakin Research Online

Proceedings - University of Groningen

Online Research @ Cardiff

ARTS repository - University of Groningen

eScholarship - University of California