13 research outputs found
Table_1_Comparison of Resampling Techniques for Imbalanced Datasets in Machine Learning: Application to Epileptogenic Zone Localization From Interictal Intracranial EEG Recordings in Patients With Focal Epilepsy.DOCX
Aim: In neuroscience research, data are quite often characterized by an imbalanced distribution between the majority and minority classes, an issue that can limit or even worsen the prediction performance of machine learning methods. Different resampling procedures have been developed to face this problem and a lot of work has been done in comparing their effectiveness in different scenarios. Notably, the robustness of such techniques has been tested among a wide variety of different datasets, without considering the performance of each specific dataset. In this study, we compare the performances of different resampling procedures for the imbalanced domain in stereo-electroencephalography (SEEG) recordings of the patients with focal epilepsies who underwent surgery.Methods: We considered data obtained by network analysis of interictal SEEG recorded from 10 patients with drug-resistant focal epilepsies, for a supervised classification problem aimed at distinguishing between the epileptogenic and non-epileptogenic brain regions in interictal conditions. We investigated the effectiveness of five oversampling and five undersampling procedures, using 10 different machine learning classifiers. Moreover, six specific ensemble methods for the imbalanced domain were also tested. To compare the performances, Area under the ROC curve (AUC), F-measure, Geometric Mean, and Balanced Accuracy were considered.Results: Both the resampling procedures showed improved performances with respect to the original dataset. The oversampling procedure was found to be more sensitive to the type of classification method employed, with Adaptive Synthetic Sampling (ADASYN) exhibiting the best performances. All the undersampling approaches were more robust than the oversampling among the different classifiers, with Random Undersampling (RUS) exhibiting the best performance despite being the simplest and most basic classification method.Conclusions: The application of machine learning techniques that take into consideration the balance of features by resampling is beneficial and leads to more accurate localization of the epileptogenic zone from interictal periods. In addition, our results highlight the importance of the type of classification method that must be used together with the resampling to maximize the benefit to the outcome.</p
Image_1_Comparison of Resampling Techniques for Imbalanced Datasets in Machine Learning: Application to Epileptogenic Zone Localization From Interictal Intracranial EEG Recordings in Patients With Focal Epilepsy.TIF
Aim: In neuroscience research, data are quite often characterized by an imbalanced distribution between the majority and minority classes, an issue that can limit or even worsen the prediction performance of machine learning methods. Different resampling procedures have been developed to face this problem and a lot of work has been done in comparing their effectiveness in different scenarios. Notably, the robustness of such techniques has been tested among a wide variety of different datasets, without considering the performance of each specific dataset. In this study, we compare the performances of different resampling procedures for the imbalanced domain in stereo-electroencephalography (SEEG) recordings of the patients with focal epilepsies who underwent surgery.Methods: We considered data obtained by network analysis of interictal SEEG recorded from 10 patients with drug-resistant focal epilepsies, for a supervised classification problem aimed at distinguishing between the epileptogenic and non-epileptogenic brain regions in interictal conditions. We investigated the effectiveness of five oversampling and five undersampling procedures, using 10 different machine learning classifiers. Moreover, six specific ensemble methods for the imbalanced domain were also tested. To compare the performances, Area under the ROC curve (AUC), F-measure, Geometric Mean, and Balanced Accuracy were considered.Results: Both the resampling procedures showed improved performances with respect to the original dataset. The oversampling procedure was found to be more sensitive to the type of classification method employed, with Adaptive Synthetic Sampling (ADASYN) exhibiting the best performances. All the undersampling approaches were more robust than the oversampling among the different classifiers, with Random Undersampling (RUS) exhibiting the best performance despite being the simplest and most basic classification method.Conclusions: The application of machine learning techniques that take into consideration the balance of features by resampling is beneficial and leads to more accurate localization of the epileptogenic zone from interictal periods. In addition, our results highlight the importance of the type of classification method that must be used together with the resampling to maximize the benefit to the outcome.</p
SF-PC scores for SWITCH.
Anatomical complexity and data dimensionality present major issues when analysing brain connectivity data. The functional and anatomical aspects of the connections taking place in the brain are in fact equally relevant and strongly intertwined. However, due to theoretical challenges and computational issues, their relationship is often overlooked in neuroscience and clinical research. In this work, we propose to tackle this problem through Smooth Functional Principal Component Analysis, which enables to perform dimensional reduction and exploration of the variability in functional connectivity maps, complying with the formidably complicated anatomy of the grey matter volume. In particular, we analyse a population that includes controls and subjects affected by schizophrenia, starting from fMRI data acquired at rest and during a task-switching paradigm. For both sessions, we first identify the common modes of variation in the entire population. We hence explore whether the subjects’ expressions along these common modes of variation differ between controls and pathological subjects. In each session, we find principal components that are significantly differently expressed in the healthy vs pathological subjects (with p-values </div
SF-PC scores for REST.
Boxplots of scores of the first four SF-PC functions obtained by the proposed SF-PCA, computed on the entire population during REST, and plotted separately for CTRL and SCHZ groups. P-values of Wilcoxon tests that compares the distributions of the PCs scores for CTRL and SCHZ groups.</p
MV-PC for REST.
Anatomical complexity and data dimensionality present major issues when analysing brain connectivity data. The functional and anatomical aspects of the connections taking place in the brain are in fact equally relevant and strongly intertwined. However, due to theoretical challenges and computational issues, their relationship is often overlooked in neuroscience and clinical research. In this work, we propose to tackle this problem through Smooth Functional Principal Component Analysis, which enables to perform dimensional reduction and exploration of the variability in functional connectivity maps, complying with the formidably complicated anatomy of the grey matter volume. In particular, we analyse a population that includes controls and subjects affected by schizophrenia, starting from fMRI data acquired at rest and during a task-switching paradigm. For both sessions, we first identify the common modes of variation in the entire population. We hence explore whether the subjects’ expressions along these common modes of variation differ between controls and pathological subjects. In each session, we find principal components that are significantly differently expressed in the healthy vs pathological subjects (with p-values </div
Mean connectivity maps of REST and SWITCH session.
The first row depicts the mean z-score of the CTRL group for the two sessions, while the second depicts the mean z-score of the SCHZ group.</p
SF-PC functions for REST.
Visualizations of the first four SF-PC functions obtained by the proposed SF-PCA, computed on the entire population during REST, and plotted separately for CTRL and SCHZ groups. Top: median(sCTRL) â‹… SF-PC. Bottom: median(sSCHZ)â‹… SF-PC. The figure highlights the median behaviour on each component, for subjects in the two populations.</p
SF-PC functions for SWITCH.
Anatomical complexity and data dimensionality present major issues when analysing brain connectivity data. The functional and anatomical aspects of the connections taking place in the brain are in fact equally relevant and strongly intertwined. However, due to theoretical challenges and computational issues, their relationship is often overlooked in neuroscience and clinical research. In this work, we propose to tackle this problem through Smooth Functional Principal Component Analysis, which enables to perform dimensional reduction and exploration of the variability in functional connectivity maps, complying with the formidably complicated anatomy of the grey matter volume. In particular, we analyse a population that includes controls and subjects affected by schizophrenia, starting from fMRI data acquired at rest and during a task-switching paradigm. For both sessions, we first identify the common modes of variation in the entire population. We hence explore whether the subjects’ expressions along these common modes of variation differ between controls and pathological subjects. In each session, we find principal components that are significantly differently expressed in the healthy vs pathological subjects (with p-values </div
Figure 4
<p>(A) Changes in extracellular pH <u>(pH<sub>e</sub>)</u> in the PC and in mOT induced by ipsilateral MCA occlusion and reperfusion. Occlusion induced a rapid metabolic acidification of the extracellular microenviroment in PC, interrupted by a transient and mild basification (arrow) associated to the hypoxic spreading depression (HD, asterisk). No changes in extracellular [H<sup>+</sup>] were recorded in the mOT, that is not served by the MCA. (B) Simultaneous changes in extracellular potassium concentration ([K<sup>+</sup>]<sub>o</sub>)and extracellular pH in the PC after MCA occlusion and reperfusion. An initial enhancement in [K<sup>+</sup>] was followed by a fast and large increase in [K<sup>+</sup>]<sub>o</sub>. associated to the HD. The schematic drawing on the right illustrates the position of the two-barrel recording electrodes. The field responses (FP) recorded with the conventional extracellular barrel are also shown. The period of MCA occlusion is marked by the shaded area.</p
Figure 2
<p>A. Recostrution of the distribution of NCs superimposed on a coronal section immunoreacted with anti-MAP-2 antibody (immunoperoxydase staining) to identify the ischemic region. Green fluorescent CFDA-stained stem cells were counted and plotted on the adjacent section stained with the MAP-2 fluorescent antibody (calibration bar = 1 cm). In B and C, microphotographs of green fluorescent CFDA-stained stem cells on sections counterstained with the fluorescent anti-MAP-2 antibody are shown at ×10 (B) and ×40 (C) magnifications. The pictures were taken in the transition region between the MAP-2<sup>+</sup> and the MAP-2<sup>−</sup> areas in the olfactory region. Calibration bars: 150 µm and 16 µm for B and C, respectively.</p