43 research outputs found

    Almost Separable Data Aggregation by Layers of Formal Neurons

    Get PDF
    Information extraction or knowledge discovery from large data sets should be linked to data aggregation process. Data aggregation process can result in a new data representation with decreased number of objects of a given set. A deterministic approach to separable data aggregation means a lesser number of objects without mixing of objects from different categories. A statistical approach is less restrictive and allows for almost separable data aggregation with a low level of mixing of objects from different categories. Layers of formal neurons can be designed for the purpose of data aggregation both in the case of deterministic and statistical approach. The proposed designing method is based on minimization of the of the convex and piecewise linear (CPL) criterion functions

    Computing on Vertices in Data Mining

    Get PDF
    The main challenges in data mining are related to large, multi-dimensional data sets. There is a need to develop algorithms that are precise and efficient enough to deal with big data problems. The Simplex algorithm from linear programming can be seen as an example of a successful big data problem solving tool. According to the fundamental theorem of linear programming the solution of the optimization problem can found in one of the vertices in the parameter space. The basis exchange algorithms also search for the optimal solution among finite number of the vertices in the parameter space. Basis exchange algorithms enable the design of complex layers of classifiers or predictive models based on a small number of multivariate data vectors

    Selected Works in Bioinformatics

    Get PDF
    This book consists of nine chapters covering a variety of bioinformatics subjects, ranging from database resources for protein allergens, unravelling genetic determinants of complex disorders, characterization and prediction of regulatory motifs, computational methods for identifying the best classifiers and key disease genes in large-scale transcriptomic and proteomic experiments, functional characterization of inherently unfolded proteins/regions, protein interaction networks and flexible protein-protein docking. The computational algorithms are in general presented in a way that is accessible to advanced undergraduate students, graduate students and researchers in molecular biology and genetics. The book should also serve as stepping stones for mathematicians, biostatisticians, and computational scientists to cross their academic boundaries into the dynamic and ever-expanding field of bioinformatics

    Data Mining

    Get PDF
    The availability of big data due to computerization and automation has generated an urgent need for new techniques to analyze and convert big data into useful information and knowledge. Data mining is a promising and leading-edge technology for mining large volumes of data, looking for hidden information, and aiding knowledge discovery. It can be used for characterization, classification, discrimination, anomaly detection, association, clustering, trend or evolution prediction, and much more in fields such as science, medicine, economics, engineering, computers, and even business analytics. This book presents basic concepts, ideas, and research in data mining

    Predicting breast cancer risk, recurrence and survivability

    Full text link
    This thesis focuses on predicting breast cancer at early stages by using machine learning algorithms based on biological datasets. The accuracy of those algorithms has been improved to enable the physicians to enhance the success of treatment, thus saving lives and avoiding several further medical tests

    Generalised fourier analysis of human chromosome images

    Get PDF

    Improvement of EEG based brain computer interface by application of tripolar electrodes and independent component analysis

    Get PDF
    For persons with severe disabilities, a brain computer interface (BCI) may be a viable means of communication, with scalp-recorded electroencephalogram (EEG) being the most common signal employed in the operation of a BCI. Various electrode configurations can be used for EEG recording, one of which was a set of concentric rings that was referred to as a Laplacian electrode. It has been shown that Lapalacian EEG could improve classification in EEG recognition, but the complete advantages of this configuration have not been established. This project included two parts. First, a modeling study was performed using Independent Component Analysis (ICA) to prove that tripolar electrodes could provide better EEG signal for BCI. Next, human experiments were performed to study the application of tripolar electrodes in a BCI model to show that the application of tripolar electrodes and data-segment related parameter selection can improve EEG classification ratio for BCI. In the first part of work, an improved four-layer anisotropic concentric spherical head computer model was programmed, then four configurations of time-varying dipole signals were used to generate the scalp surface signals that would be obtained with tripolar and disc electrodes. Four important EEG artifacts were tested: eye blinking, cheek movements, jaw movements and talking. Finally, a fast fixed-point algorithm was used for signal-independent component analysis (ICA). The results showed that signals from tripolar electrodes generated better ICA separation than signals from disc electrodes for EEG signals, suggesting that tripolar electrodes could provide better EEG signal for BCI. The human experiments were divided into three parts: improvement of the data acquirement system by application of tripolar concentric electrodes and related circuit; development of pre-feature selection algorithm to improve BCI EEG signal classification; and an autoregressive (AR) model and Mahalanobis distance-based linear classifier for BCI classification. In the work, tripolar electrodes and corresponding data acquisition system were developed. Two sets of left/right hand motor imagery EEG signals were acquired. Then the effectiveness of signals from tripolar concentric electrodes and disc electrodes were compared for use as a BCI. The pre-feature selection methods were developed and applied to four data segment-related parameters: the length of the data segment in each trial (LDS), its starting position (SPD), the number of trials (NT) and the AR model order (AR Order). The study showed that, compared to the classification ratio (CR) without parameter selection, the CR was significantly different with an increase by 20% to 30% with proper selection of these data-segment-related parameter values and that the optimum parameter values were subject-dependent, which suggests that the data-segment-related parameters should be individualized when building models for BCI. The experiments also showed that that tripolar concentric electrodes generated significantly higher classification accuracy than disc electrodes
    corecore