126 research outputs found

    Deepbet: Fast brain extraction of T1-weighted MRI using Convolutional Neural Networks

    Full text link
    Brain extraction in magnetic resonance imaging (MRI) data is an important segmentation step in many neuroimaging preprocessing pipelines. Image segmentation is one of the research fields in which deep learning had the biggest impact in recent years enabling high precision segmentation with minimal compute. Consequently, traditional brain extraction methods are now being replaced by deep learning-based methods. Here, we used a unique dataset comprising 568 T1-weighted (T1w) MR images from 191 different studies in combination with cutting edge deep learning methods to build a fast, high-precision brain extraction tool called deepbet. deepbet uses LinkNet, a modern UNet architecture, in a two stage prediction process. This increases its segmentation performance, setting a novel state-of-the-art performance during cross-validation with a median Dice score (DSC) of 99.0% on unseen datasets, outperforming current state of the art models (DSC = 97.8% and DSC = 97.9%). While current methods are more sensitive to outliers, resulting in Dice scores as low as 76.5%, deepbet manages to achieve a Dice score of > 96.9% for all samples. Finally, our model accelerates brain extraction by a factor of ~10 compared to current methods, enabling the processing of one image in ~2 seconds on low level hardware

    Canonical Correlation Analysis and Partial Least Squares for identifying brain-behaviour associations: a tutorial and a comparative study

    Get PDF
    Canonical Correlation Analysis (CCA) and Partial Least Squares (PLS) are powerful multivariate methods for capturing associations across two modalities of data (e.g., brain and behaviour). However, when the sample size is similar or smaller than the number of variables in the data, CCA and PLS models may overfit, i.e., find spurious associations that generalise poorly to new data. Dimensionality reduction and regularized extensions of CCA and PLS have been proposed to address this problem, yet most studies using these approaches have some limitations. This work gives a theoretical and practical introduction into the most common CCA/PLS models and their regularized variants. We examine the limitations of standard CCA and PLS when the sample size is similar or smaller than the number of variables. We discuss how dimensionality reduction and regularization techniques address this problem and explain their main advantages and disadvantages. We highlight crucial aspects of the CCA/PLS analysis framework, including optimising the hyperparameters of the model and testing the identified associations for statistical significance. We apply the described CCA/PLS models to simulated data and real data from the Human Connectome Project and the Alzheimer's Disease Neuroimaging Initiative (both of n>500). We use both low and high dimensionality versions of each data (i.e., ratios between sample size and variables in the range of ∼1-10 and ∼0.1-0.01) to demonstrate the impact of data dimensionality on the models. Finally, we summarize the key lessons of the tutorial

    Systematic Overestimation of Machine Learning Performance in Neuroimaging Studies of Depression

    Get PDF
    We currently observe a disconcerting phenomenon in machine learning studies in psychiatry: While we would expect larger samples to yield better results due to the availability of more data, larger machine learning studies consistently show much weaker performance than the numerous small-scale studies. Here, we systematically investigated this effect focusing on one of the most heavily studied questions in the field, namely the classification of patients suffering from Major Depressive Disorder (MDD) and healthy controls. Drawing upon a balanced sample of N=1,868N = 1,868 MDD patients and healthy controls from our recent international Predictive Analytics Competition (PAC), we first trained and tested a classification model on the full dataset which yielded an accuracy of 61%. Next, we mimicked the process by which researchers would draw samples of various sizes (N=4N=4 to N=150N=150) from the population and showed a strong risk of overestimation. Specifically, for small sample sizes (N=20N=20), we observe accuracies of up to 95%. For medium sample sizes (N=100N=100) accuracies up to 75% were found. Importantly, further investigation showed that sufficiently large test sets effectively protect against performance overestimation whereas larger datasets per se do not. While these results question the validity of a substantial part of the current literature, we outline the relatively low-cost remedy of larger test sets

    Identification of transdiagnostic psychiatric disorder subtypes using unsupervised learning

    Get PDF
    Psychiatric disorders show heterogeneous symptoms and trajectories, with current nosology not accurately reflecting their molecular etiology and the variability and symptomatic overlap within and between diagnostic classes. This heterogeneity impedes timely and targeted treatment. Our study aimed to identify psychiatric patient clusters that share clinical and genetic features and may profit from similar therapies. We used high-dimensional data clustering on deep clinical data to identify transdiagnostic groups in a discovery sample (N = 1250) of healthy controls and patients diagnosed with depression, bipolar disorder, schizophrenia, schizoaffective disorder, and other psychiatric disorders. We observed five diagnostically mixed clusters and ordered them based on severity. The least impaired cluster 0, containing most healthy controls, showed general well-being. Clusters 1-3 differed predominantly regarding levels of maltreatment, depression, daily functioning, and parental bonding. Cluster 4 contained most patients diagnosed with psychotic disorders and exhibited the highest severity in many dimensions, including medication load. Depressed patients were present in all clusters, indicating that we captured different disease stages or subtypes. We replicated all but the smallest cluster 1 in an independent sample (N = 622). Next, we analyzed genetic differences between clusters using polygenic scores (PGS) and the psychiatric family history. These genetic variables differed mainly between clusters 0 and 4 (prediction area under the receiver operating characteristic curve (AUC) = 81%;significant PGS: cross-disorder psychiatric risk, schizophrenia, and educational attainment). Our results confirm that psychiatric disorders consist of heterogeneous subtypes sharing molecular factors and symptoms. The identification of transdiagnostic clusters advances our understanding of the heterogeneity of psychiatric disorders and may support the development of personalized treatments

    GateNet: A novel Neural Network Architecture for Automated Flow Cytometry Gating

    Full text link
    Flow cytometry is widely used to identify cell populations in patient-derived fluids such as peripheral blood (PB) or cerebrospinal fluid (CSF). While ubiquitous in research and clinical practice, flow cytometry requires gating, i.e. cell type identification which requires labor-intensive and error-prone manual adjustments. To facilitate this process, we designed GateNet, the first neural network architecture enabling full end-to-end automated gating without the need to correct for batch effects. We train GateNet with over 8,000,000 events based on N=127 PB and CSF samples which were manually labeled independently by four experts. We show that for novel, unseen samples, GateNet achieves human-level performance (F1 score ranging from 0.910 to 0.997). In addition we apply GateNet to a publicly available dataset confirming generalization with an F1 score of 0.936. As our implementation utilizes graphics processing units (GPU), gating only needs 15 microseconds per event. Importantly, we also show that GateNet only requires ~10 samples to reach human-level performance, rendering it widely applicable in all domains of flow cytometry

    The AUXIN BINDING PROTEIN 1 Is Required for Differential Auxin Responses Mediating Root Growth

    Get PDF
    Background In plants, the phytohormone auxin is a crucial regulator sustaining growth and development. At the cellular level, auxin is interpreted differentially in a tissue- and dose-dependent manner. Mechanisms of auxin signalling are partially unknown and the contribution of the AUXIN BINDING PROTEIN 1 (ABP1) as an auxin receptor is still a matter of debate. Methodology/Principal Findings Here we took advantage of the present knowledge of the root biological system to demonstrate that ABP1 is required for auxin response. The use of conditional ABP1 defective plants reveals that the protein is essential for maintenance of the root meristem and acts at least on the D-type CYCLIN/RETINOBLASTOMA pathway to control entry into the cell cycle. ABP1 affects PLETHORA gradients and confers auxin sensitivity to root cells thus defining the competence of the cells to be maintained within the meristem or to elongate. ABP1 is also implicated in the regulation of gene expression in response to auxin. Conclusions/Significance Our data support that ABP1 is a key regulator for root growth and is required for auxin-mediated responses. Differential effects of ABP1 on various auxin responses support a model in which ABP1 is the major regulator for auxin action on the cell cycle and regulates auxin-mediated gene expression and cell elongation in addition to the already well known TIR1-mediated ubiquitination pathway

    Principal component analysis as an efficient method for capturing multivariate brain signatures of complex disorders—ENIGMA study in people with bipolar disorders and obesity

    Get PDF
    Multivariate techniques better fit the anatomy of complex neuropsychiatric disorders which are characterized not by alterations in a single region, but rather by variations across distributed brain networks. Here, we used principal component analysis (PCA) to identify patterns of covariance across brain regions and relate them to clinical and demographic variables in a large generalizable dataset of individuals with bipolar disorders and controls. We then compared performance of PCA and clustering on identical sample to identify which methodology was better in capturing links between brain and clinical measures. Using data from the ENIGMA-BD working group, we investigated T1-weighted structural MRI data from 2436 participants with BD and healthy controls, and applied PCA to cortical thickness and surface area measures. We then studied the association of principal components with clinical and demographic variables using mixed regression models. We compared the PCA model with our prior clustering analyses of the same data and also tested it in a replication sample of 327 participants with BD or schizophrenia and healthy controls. The first principal component, which indexed a greater cortical thickness across all 68 cortical regions, was negatively associated with BD, BMI, antipsychotic medications, and age and was positively associated with Li treatment. PCA demonstrated superior goodness of fit to clustering when predicting diagnosis and BMI. Moreover, applying the PCA model to the replication sample yielded significant differences in cortical thickness between healthy controls and individuals with BD or schizophrenia. Cortical thickness in the same widespread regional network as determined by PCA was negatively associated with different clinical and demographic variables, including diagnosis, age, BMI, and treatment with antipsychotic medications or lithium. PCA outperformed clustering and provided an easy-to-use and interpret method to study multivariate associations between brain structure and system-level variables. Practitioner Points: In this study of 2770 Individuals, we confirmed that cortical thickness in widespread regional networks as determined by principal component analysis (PCA) was negatively associated with relevant clinical and demographic variables, including diagnosis, age, BMI, and treatment with antipsychotic medications or lithium. Significant associations of many different system-level variables with the same brain network suggest a lack of one-to-one mapping of individual clinical and demographic factors to specific patterns of brain changes. PCA outperformed clustering analysis in the same data set when predicting group or BMI, providing a superior method for studying multivariate associations between brain structure and system-level variables.</p

    Principal component analysis as an efficient method for capturing multivariate brain signatures of complex disorders—ENIGMA study in people with bipolar disorders and obesity

    Get PDF
    Multivariate techniques better fit the anatomy of complex neuropsychiatric disorders which are characterized not by alterations in a single region, but rather by variations across distributed brain networks. Here, we used principal component analysis (PCA) to identify patterns of covariance across brain regions and relate them to clinical and demographic variables in a large generalizable dataset of individuals with bipolar disorders and controls. We then compared performance of PCA and clustering on identical sample to identify which methodology was better in capturing links between brain and clinical measures. Using data from the ENIGMA-BD working group, we investigated T1-weighted structural MRI data from 2436 participants with BD and healthy controls, and applied PCA to cortical thickness and surface area measures. We then studied the association of principal components with clinical and demographic variables using mixed regression models. We compared the PCA model with our prior clustering analyses of the same data and also tested it in a replication sample of 327 participants with BD or schizophrenia and healthy controls. The first principal component, which indexed a greater cortical thickness across all 68 cortical regions, was negatively associated with BD, BMI, antipsychotic medications, and age and was positively associated with Li treatment. PCA demonstrated superior goodness of fit to clustering when predicting diagnosis and BMI. Moreover, applying the PCA model to the replication sample yielded significant differences in cortical thickness between healthy controls and individuals with BD or schizophrenia. Cortical thickness in the same widespread regional network as determined by PCA was negatively associated with different clinical and demographic variables, including diagnosis, age, BMI, and treatment with antipsychotic medications or lithium. PCA outperformed clustering and provided an easy-to-use and interpret method to study multivariate associations between brain structure and system-level variables. Practitioner Points: In this study of 2770 Individuals, we confirmed that cortical thickness in widespread regional networks as determined by principal component analysis (PCA) was negatively associated with relevant clinical and demographic variables, including diagnosis, age, BMI, and treatment with antipsychotic medications or lithium. Significant associations of many different system-level variables with the same brain network suggest a lack of one-to-one mapping of individual clinical and demographic factors to specific patterns of brain changes. PCA outperformed clustering analysis in the same data set when predicting group or BMI, providing a superior method for studying multivariate associations between brain structure and system-level variables.</p

    High levels of microRNA-21 in the stroma of colorectal cancers predict short disease-free survival in stage II colon cancer patients

    Get PDF
    Approximately 25% of all patients with stage II colorectal cancer will experience recurrent disease and subsequently die within 5 years. MicroRNA-21 (miR-21) is upregulated in several cancer types and has been associated with survival in colon cancer. In the present study we developed a robust in situ hybridization assay using high-affinity Locked Nucleic Acid (LNA) probes that specifically detect miR-21 in formalin-fixed paraffin embedded (FFPE) tissue samples. The expression of miR-21 was analyzed by in situ hybridization on 130 stage II colon and 67 stage II rectal cancer specimens. The miR-21 signal was revealed as a blue chromogenic reaction, predominantly observed in fibroblast-like cells located in the stromal compartment of the tumors. The expression levels were measured using image analysis. The miR-21 signal was determined as the total blue area (TB), or the area fraction relative to the nuclear density (TBR) obtained using a red nuclear stain. High TBR (and TB) estimates of miR-21 expression correlated significantly with shorter disease-free survival (p = 0.004, HR = 1.28, 95% CI: 1.06–1.55) in the stage II colon cancer patient group, whereas no significant correlation with disease-free survival was observed in the stage II rectal cancer group. In multivariate analysis both TB and TBR estimates were independent of other clinical parameters (age, gender, total leukocyte count, K-RAS mutational status and MSI). We conclude that miR-21 is primarily a stromal microRNA, which when measured by image analysis identifies a subgroup of stage II colon cancer patients with short disease-free survival
    corecore