100 research outputs found

    Assessing similarity of feature selection techniques in high-dimensional domains

    Get PDF
    Recent research efforts attempt to combine multiple feature selection techniques instead of using a single one. However, this combination is often made on an “ad hoc” basis, depending on the specific problem at hand, without considering the degree of diversity/similarity of the involved methods. Moreover, though it is recognized that different techniques may return quite dissimilar outputs, especially in high dimensional/small sample size domains, few direct comparisons exist that quantify these differences and their implications on classification performance. This paper aims to provide a contribution in this direction by proposing a general methodology for assessing the similarity between the outputs of different feature selection methods in high dimensional classification problems. Using as benchmark the genomics domain, an empirical study has been conducted to compare some of the most popular feature selection methods, and useful insight has been obtained about their pattern of agreement

    An evolutionary approach for balancing effectiveness and representation level in gene selection

    Get PDF
    As data mining develops and expands to new application areas, feature selection also reveals various aspects to be considered. This paper underlines two aspects that seem to categorize the large body of available feature selection algorithms: the effectiveness and the representation level. The effectiveness deals with selecting the minimum set of variables that maximize the accuracy of a classifier and the representation level concerns discovering how relevant the variables are for the domain of interest. For balancing the above aspects, the paper proposes an evolutionary framework for feature selection that expresses a hybrid method, organized in layers, each of them exploits a specific model of search strategy. Extensive experiments on gene selection from DNA-microarray datasets are presented and discussed. Results indicate that the framework compares well with different hybrid methods proposed in literature as it has the capability of finding well suited subsets of informative features while improving classification accurac

    Convolutional Neural Network for Seizure Detection of Nocturnal Frontal Lobe Epilepsy

    Get PDF
    The Nocturnal Frontal Lobe Epilepsy (NFLE) is a form of epilepsy in which seizures occur predominantly during sleep. In other forms of epilepsy, the commonly used clinical approach mainly involves manual inspection of encephalography (EEG) signals, a laborious and time-consuming process which often requires the contribution of more than one experienced neurologist. In the last decades, numerous approaches to automate this detection have been proposed and, more recently, machine learning has shown very promising performance. In this paper, an original Convolutional Neural Network (CNN) architecture is proposed to develop patient-specific seizure detection models for three patients affected by NFLE. The performances, in terms of accuracy, sensitivity, and specificity, exceed by several percentage points those in the most recent literature. The capability of the patient-specific models has been also tested to compare the obtained seizure onset times with those provided by the neurologists, with encouraging results. Moreover, the same CNN architecture has been used to develop a cross-patient seizure detection system, resorting to the transfer-learning paradigm. Starting from a patient-specific model, few data from a new patient are enough to customize his model. This contribution aims to alleviate the task of neurologists, who may have a robust indication to corroborate their clinical conclusions

    Performance Comparison of Machine Learning Disruption Predictors at JET

    Get PDF
    Reliable disruption prediction (DP) and disruption mitigation systems are considered unavoidable during international thermonuclear experimental reactor (ITER) operations and in the view of the next fusion reactors such as the DEMOnstration Power Plant (DEMO) and China Fusion Engineering Test Reactor (CFETR). In the last two decades, a great number of DP systems have been developed using data-driven methods. The performance of the DP models has been improved over the years both for a more appropriate choice of diagnostics and input features and for the availability of increasingly powerful data-driven modelling techniques. However, a direct comparison among the proposals has not yet been conducted. Such a comparison is mandatory, at least for the same device, to learn lessons from all these efforts and finally choose the best set of diagnostic signals and the best modelling approach. A first effort towards this goal is made in this paper, where different DP models will be compared using the same performance indices and the same device. In particular, the performance of a conventional Multilayer Perceptron Neural Network (MLP-NN) model is compared with those of two more sophisticated models, based on Generative Topographic Mapping (GTM) and Convolutional Neural Networks (CNN), on the same real time diagnostic signals from several experiments at the JET tokamak. The most common performance indices have been used to compare the different DP models and the results are deeply discussed. The comparison confirms the soundness of all the investigated machine learning approaches and the chosen diagnostics, enables us to highlight the pros and cons of each model, and helps to consciously choose the approach that best matches with the plasma protection needs

    Decision trees to evaluate the risk of developing multiple sclerosis

    Get PDF
    Introduction: Multiple sclerosis (MS) is a persistent neurological condition impacting the central nervous system (CNS). The precise cause of multiple sclerosis is still uncertain; however, it is thought to arise from a blend of genetic and environmental factors. MS diagnosis includes assessing medical history, conducting neurological exams, performing magnetic resonance imaging (MRI) scans, and analyzing cerebrospinal fluid. While there is currently no cure for MS, numerous treatments exist to address symptoms, decelerate disease progression, and enhance the quality of life for individuals with MS. Methods: This paper introduces a novel machine learning (ML) algorithm utilizing decision trees to address a key objective: creating a predictive tool for assessing the likelihood of MS development. It achieves this by combining prevalent demographic risk factors, specifically gender, with crucial immunogenetic risk markers, such as the alleles responsible for human leukocyte antigen (HLA) class I molecules and the killer immunoglobulin-like receptors (KIR) genes responsible for natural killer lymphocyte receptors. Results: The study included 619 healthy controls and 299 patients affected by MS, all of whom originated from Sardinia. The gender feature has been disregarded due to its substantial bias in influencing the classification outcomes. By solely considering immunogenetic risk markers, the algorithm demonstrates an ability to accurately identify 73.24% of MS patients and 66.07% of individuals without the disease. Discussion: Given its notable performance, this system has the potential to support clinicians in monitoring the relatives of MS patients and identifying individuals who are at an increased risk of developing the disease

    Real-Time Detection of Overloads on the Plasma-Facing Components of Wendelstein 7-X

    Get PDF
    Wendelstein 7-X (W7-X) is the leading experiment on the path of demonstrating that stellarators are a feasible concept for a future power plant. One of its major goals is to prove quasi-steady-state operation in a reactor-relevant parameter regime. The surveillance and protection of the water-cooled plasma-facing components (PFCs) against overheating is fundamental to guarantee a safe steady-state high-heat-flux operation. The system has to detect thermal events in real-time and timely interrupt operation if it detects a critical event. The fast reaction times required to prevent damage to the device make it imperative to automate fully the image analysis algorithms. During the past operational phases, W7-X was equipped with inertially cooled test divertor units and the system still required manual supervision. With the experience gained, we have designed a new real-time PFC protection system based on image processing techniques. It uses a precise registration of the entire field of view against the CAD model to determine the temperature limits and thermal properties of the different PFCs. Instead of reacting when the temperature limits are breached in certain regions of interest, the system predicts when an overload will occur based on a heat flux estimation, triggering the interlock system in advance to compensate for the system delay. To conclude, we present our research roadmap towards a feedback control system of thermal loads to prevent unnecessary plasma interruptions in long high-performance plasmas

    Synthesis, Structural investigation and luminescence spectroscopy of nanocrystalline Gd3Ga5O12 Doped with lanthanide ions

    Get PDF
    Gadolinium gallium garnet (GGG) nanocrystalline powder doped with. lanthanide ions (Eu3+ and Er3+) have been obtained using two different methods (coprecipitation and Pechini). The X-ray diffraction results show that single phase cubic GGG nanopowders have been obtained for both preparation methods. The samples prepared by the two procedures show different morphologies, as revealed by scanning electron microscopy images. The Er3+-doped nanopowders obtained with the coprecipitation method show strong luminescence upon 488.0 nm excitation. The emission spectrum is similar to the one of the single crystal and of nanopowders of the same composition prepared by a combustion synthesis. The Er3+-doped GGG nanopowders obtained by the coprecipitation method show efficient upconversion in the green region (around 550 nm) upon excitation in the near IR at a wavelength of 800 nm

    Decision trees to evaluate the risk of developing multiple sclerosis

    Get PDF
    IntroductionMultiple sclerosis (MS) is a persistent neurological condition impacting the central nervous system (CNS). The precise cause of multiple sclerosis is still uncertain; however, it is thought to arise from a blend of genetic and environmental factors. MS diagnosis includes assessing medical history, conducting neurological exams, performing magnetic resonance imaging (MRI) scans, and analyzing cerebrospinal fluid. While there is currently no cure for MS, numerous treatments exist to address symptoms, decelerate disease progression, and enhance the quality of life for individuals with MS.MethodsThis paper introduces a novel machine learning (ML) algorithm utilizing decision trees to address a key objective: creating a predictive tool for assessing the likelihood of MS development. It achieves this by combining prevalent demographic risk factors, specifically gender, with crucial immunogenetic risk markers, such as the alleles responsible for human leukocyte antigen (HLA) class I molecules and the killer immunoglobulin-like receptors (KIR) genes responsible for natural killer lymphocyte receptors.ResultsThe study included 619 healthy controls and 299 patients affected by MS, all of whom originated from Sardinia. The gender feature has been disregarded due to its substantial bias in influencing the classification outcomes. By solely considering immunogenetic risk markers, the algorithm demonstrates an ability to accurately identify 73.24% of MS patients and 66.07% of individuals without the disease.DiscussionGiven its notable performance, this system has the potential to support clinicians in monitoring the relatives of MS patients and identifying individuals who are at an increased risk of developing the disease

    Genome-wide Analyses Identify KIF5A as a Novel ALS Gene

    Get PDF
    To identify novel genes associated with ALS, we undertook two lines of investigation. We carried out a genome-wide association study comparing 20,806 ALS cases and 59,804 controls. Independently, we performed a rare variant burden analysis comparing 1,138 index familial ALS cases and 19,494 controls. Through both approaches, we identified kinesin family member 5A (KIF5A) as a novel gene associated with ALS. Interestingly, mutations predominantly in the N-terminal motor domain of KIF5A are causative for two neurodegenerative diseases: hereditary spastic paraplegia (SPG10) and Charcot-Marie-Tooth type 2 (CMT2). In contrast, ALS-associated mutations are primarily located at the C-terminal cargo-binding tail domain and patients harboring loss-of-function mutations displayed an extended survival relative to typical ALS cases. Taken together, these results broaden the phenotype spectrum resulting from mutations in KIF5A and strengthen the role of cytoskeletal defects in the pathogenesis of ALS.Peer reviewe
    • …
    corecore