6,750 research outputs found
Automatic epilepsy detection using fractal dimensions segmentation and GP-SVM classification
Objective: The most important part of signal processing for classification is feature extraction as a mapping from original input electroencephalographic (EEG) data space to new features space with the biggest class separability value. Features are not only the most important, but also the most difficult task from the classification process as they define input data and classification quality. An ideal set of features would make the classification problem trivial. This article presents novel methods of feature extraction processing and automatic epilepsy seizure classification combining machine learning methods with genetic evolution algorithms.
Methods: Classification is performed on EEG data that represent electric brain activity. At first, the signal is preprocessed with digital filtration and adaptive segmentation using fractal dimensions as the only segmentation measure. In the next step, a novel method using genetic programming (GP) combined with support vector machine (SVM) confusion matrix as fitness function weight is used to extract feature vectors compressed into lower dimension space and classify the final result into ictal or interictal epochs.
Results: The final application of GP SVM method improves the discriminatory performance of a classifier by reducing feature dimensionality at the same time. Members of the GP tree structure represent the features themselves and their number is automatically decided by the compression function introduced in this paper. This novel method improves the overall performance of the SVM classification by dramatically reducing the size of input feature vector.
Conclusion: According to results, the accuracy of this algorithm is very high and comparable, or even superior to other automatic detection algorithms. In combination with the great efficiency, this algorithm can be used in real-time epilepsy detection applications. From the results of the algorithm's classification, we can observe high sensitivity, specificity results, except for the Generalized Tonic Clonic Seizure (GTCS). As the next step, the optimization of the compression stage and final SVM evaluation stage is in place. More data need to be obtained on GTCS to improve the overall classification score for GTCS.Web of Science142449243
SIGNAL PROCESSING FOR RAMAN SPECTRA FOR DISEASE DETECTION
Raman Spectroscopy enables in-depth study into the molecular structure of solid, liquid and gasses from its scattering spectrum. As such, the spectrum could offer a biochemical fingerprint to identify unknown molecules. Surface Enhanced Raman Spectroscopy (SERS) amplifies the weak Raman signal by 10+3 to 10+7 times, revolutionary making the method appealing to the research community. SERS has been proven useful for disease detection from a medium such as a cell, serum, urine, plasma, saliva, tears. The spectra displayed are noisy and complicated by the presence of other molecules, besides the targeted one. Moreover, the difference between the infected and controlled samples is far too minute for detection by the naked human eyes. Hence, signal processing techniques are found crucial to single out fingerprint of the target molecule from biological spectra. Our work here examines signal processing techniques attempted on SERS spectra for disease detection, such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Artificial Neural Network (ANN), Support Vector Machine (SVM) and Logistic Regression Analysis (LRA). It is found that PCA-LDA is the most popular (45%), ensued by PCA-ANN (33%) and SVM (22%). PCA-SVM yields the highest in accuracy (99.9%), followed by PCA-ANN (98%) and LRA (97%). PCA-LDA and SVM score the highest in both sensitivity-specificity.Keywords: Raman Spectra, Surface Enhanced Raman Spectroscopy (SERS), Neural Network (NN), Support Vector Machine (SVM), Logistic Regression Analysis (LRA), Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA)
Spectrochemical analysis in blood plasma combined with subsequent chemometrics for fibromyalgia detection
Fibromyalgia is a rheumatologic condition characterized by multiple and chronic body pain, and other typical symptoms such as intense fatigue, anxiety and depression. It is a very complex disease where treatment is often made by non-medicated alternatives in order to alleviate symptoms and improve the patient’s quality of life. Herein, we propose a method to detect patients with fibromyalgia (n = 252, 126 controls and 126 patients with fibromyalgia) through the analysis of their blood plasma using attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy in conjunction with chemometric techniques, hence, providing a low-cost, fast and accurate diagnostic approach. Different chemometric algorithms were tested to classify the spectral data; genetic algorithm with linear discriminant analysis (GA-LDA) achieved the best diagnostic results with a sensitivity of 89.5% in an external test set. The GA-LDA model identified 24 spectral wavenumbers responsible for class separation; amongst these, the Amide II (1,545 cm−1) and proteins (1,425 cm−1) were identified to be discriminant features. These results reinforce the potential of ATR-FTIR spectroscopy with multivariate analysis as a new tool to screen and detect patients with fibromyalgia in a fast, low-cost, non-destructive and minimally invasive fashion
Automatic vocalisation-based detection of fragile X syndrome and Rett syndrome
Fragile X syndrome (FXS) and Rett syndrome (RTT) are developmental disorders currently not diagnosed before toddlerhood. Even though speech-language deficits are among the key symptoms of both conditions, little is known about infant vocalisation acoustics for an automatic earlier identification of affected individuals. To bridge this gap, we applied intelligent audio analysis methodology to a compact dataset of 4454 home-recorded vocalisations of 3 individuals with FXS and 3 individuals with RTT aged 6 to 11 months, as well as 6 age- and gender-matched typically developing controls (TD). On the basis of a standardised set of 88 acoustic features, we trained linear kernel support vector machines to evaluate the feasibility of automatic classification of (a) FXS vs TD, (b) RTT vs TD, (c) atypical development (FXS+RTT) vs TD, and (d) FXS vs RTT vs TD. In paradigms (a)–(c), all infants were correctly classified; in paradigm (d), 9 of 12 were so. Spectral/cepstral and energy-related features were most relevant for classification across all paradigms. Despite the small sample size, this study reveals new insights into early vocalisation characteristics in FXS and RTT, and provides technical underpinnings for a future earlier identification of affected individuals, enabling earlier intervention and family counselling
Detection of severe obstructive sleep apnea through voice analysis
tThis paper deals with the potential and limitations of using voice and speech processing to detect Obstruc-tive Sleep Apnea (OSA). An extensive body of voice features has been extracted from patients whopresent various degrees of OSA as well as healthy controls. We analyse the utility of a reduced set offeatures for detecting OSA. We apply various feature selection and reduction schemes (statistical rank-ing, Genetic Algorithms, PCA, LDA) and compare various classifiers (Bayesian Classifiers, kNN, SupportVector Machines, neural networks, Adaboost). S-fold crossvalidation performed on 248 subjects showsthat in the extreme cases (that is, 127 controls and 121 patients with severe OSA) voice alone is able todiscriminate quite well between the presence and absence of OSA. However, this is not the case withmild OSA and healthy snoring patients where voice seems to play a secondary role. We found that thebest classification schemes are achieved using a Genetic Algorithm for feature selection/reduction
Grey-matter texture abnormalities and reduced hippocampal volume are distinguishing features of schizophrenia
Neurodevelopmental processes are widely believed to underlie schizophrenia. Analysis of
brain texture from conventional magnetic resonance imaging (MRI) can detect disturbance in
brain cytoarchitecture. We tested the hypothesis that patients with schizophrenia manifest
quantitative differences in brain texture that, alongside discrete volumetric changes, may
serve as an endophenotypic biomarker. Texture analysis (TA) of grey matter distribution and
voxel-based morphometry (VBM) of regional brain volumes were applied to MRI scans of 27
patients with schizophrenia and 24 controls. Texture parameters (uniformity and entropy)
were also used as covariates in VBM analyses to test for correspondence with regional brain
volume. Linear discriminant analysis tested if texture and volumetric data predicted
diagnostic group membership (schizophrenia or control). We found that uniformity and
entropy of grey matter differed significantly between individuals with schizophrenia and
controls at the fine spatial scale (filter width below 2 mm). Within the schizophrenia group,
these texture parameters correlated with volumes of the left hippocampus, right amygdala
and cerebellum. The best predictor of diagnostic group membership was the combination of
fine texture heterogeneity and left hippocampal size. This study highlights the presence of
distributed grey-matter abnormalities in schizophrenia, and their relation to focal structural
abnormality of the hippocampus. The conjunction of these features has potential as a
neuroimaging endophenotype of schizophrenia
Rank discriminants for predicting phenotypes from RNA expression
Statistical methods for analyzing large-scale biomolecular data are
commonplace in computational biology. A notable example is phenotype prediction
from gene expression data, for instance, detecting human cancers,
differentiating subtypes and predicting clinical outcomes. Still, clinical
applications remain scarce. One reason is that the complexity of the decision
rules that emerge from standard statistical learning impedes biological
understanding, in particular, any mechanistic interpretation. Here we explore
decision rules for binary classification utilizing only the ordering of
expression among several genes; the basic building blocks are then two-gene
expression comparisons. The simplest example, just one comparison, is the TSP
classifier, which has appeared in a variety of cancer-related discovery
studies. Decision rules based on multiple comparisons can better accommodate
class heterogeneity, and thereby increase accuracy, and might provide a link
with biological mechanism. We consider a general framework ("rank-in-context")
for designing discriminant functions, including a data-driven selection of the
number and identity of the genes in the support ("context"). We then specialize
to two examples: voting among several pairs and comparing the median expression
in two groups of genes. Comprehensive experiments assess accuracy relative to
other, more complex, methods, and reinforce earlier observations that simple
classifiers are competitive.Comment: Published in at http://dx.doi.org/10.1214/14-AOAS738 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
- …