4,643 research outputs found
Taming Wild High Dimensional Text Data with a Fuzzy Lash
The bag of words (BOW) represents a corpus in a matrix whose elements are the
frequency of words. However, each row in the matrix is a very high-dimensional
sparse vector. Dimension reduction (DR) is a popular method to address sparsity
and high-dimensionality issues. Among different strategies to develop DR
method, Unsupervised Feature Transformation (UFT) is a popular strategy to map
all words on a new basis to represent BOW. The recent increase of text data and
its challenges imply that DR area still needs new perspectives. Although a wide
range of methods based on the UFT strategy has been developed, the fuzzy
approach has not been considered for DR based on this strategy. This research
investigates the application of fuzzy clustering as a DR method based on the
UFT strategy to collapse BOW matrix to provide a lower-dimensional
representation of documents instead of the words in a corpus. The quantitative
evaluation shows that fuzzy clustering produces superior performance and
features to Principal Components Analysis (PCA) and Singular Value
Decomposition (SVD), two popular DR methods based on the UFT strategy
Nonlinear Dimensionality Reduction for Data Visualization: An Unsupervised Fuzzy Rule-based Approach
Here, we propose an unsupervised fuzzy rule-based dimensionality reduction
method primarily for data visualization. It considers the following important
issues relevant to dimensionality reduction-based data visualization: (i)
preservation of neighborhood relationships, (ii) handling data on a non-linear
manifold, (iii) the capability of predicting projections for new test data
points, (iv) interpretability of the system, and (v) the ability to reject test
points if required. For this, we use a first-order Takagi-Sugeno type model. We
generate rule antecedents using clusters in the input data. In this context, we
also propose a new variant of the Geodesic c-means clustering algorithm. We
estimate the rule parameters by minimizing an error function that preserves the
inter-point geodesic distances (distances over the manifold) as Euclidean
distances on the projected space. We apply the proposed method on three
synthetic and three real-world data sets and visually compare the results with
four other standard data visualization methods. The obtained results show that
the proposed method behaves desirably and performs better than or comparable to
the methods compared with. The proposed method is found to be robust to the
initial conditions. The predictability of the proposed method for test points
is validated by experiments. We also assess the ability of our method to reject
output points when it should. Then, we extend this concept to provide a general
framework for learning an unsupervised fuzzy model for data projection with
different objective functions. To the best of our knowledge, this is the first
attempt to manifold learning using unsupervised fuzzy modeling
Application of biosignal-driven intelligent systems for multifunction prosthesis control
University of Technology, Sydney. Faculty of Engineering and Information Technology.Prosthetic devices aim to provide an artificial alternative to missing limbs. The controller for such devices is usually driven by the biosignals generated by the human body, particularly Electromyogram (EMG) or Electroencephalogram (EEG) signals. Such a controller utilizes a pattern recognition approach to classify the EMG signal recorded from the human muscles or the EEG signal from the brain. The aim of this thesis is to improve the EMG and EEG pattern classification accuracy. Due to the fact that the success of pattern recognition based biosignal driven systems highly depends on the quality of extracted features, a number of novel, robust, hybrid and innovative methods are proposed to achieve better performance. These methods are developed to effectively tackle many of the limitations of existing systems, in particular feature representation and dimensionality reduction. A set of knowledge extraction methods that can accurately and rapidly identify the most important attributes for classifying the arm movements are formulated. This is accomplished through the following:
1. Developing a new feature extraction technique that can identify the most important features from the high-dimensional time-frequency representation of the multichannel EMG and EEG signals. For this task, an information content estimation method using fuzzy entropies and fuzzy mutual information is proposed to identify the optimal wravelet packet transform decomposition for classification.
2. Developing a powerful variable (feature or channel) selection paradigm to improve the performance of multi-channel EMG and EEG driven systems. This will eventually lead to the development of a combined channel and feature selection technique as one possible scheme for dimensionality reduction. Two novel feature selection methods are developed under this scheme utilizing the ant colony arid differential evolution optimization techniques. The differential evolution optimization technique is further modified in a novel attempt in employing a float optimizer for the combinatorial task of feature selection, proving powerful performance by both methods.
3. Developing two feature projection techniques that extract a small subset of highly informative discriminant features, thus acting as an alternative scheme for dimensionality reduction. The two methods represent novel variations to fuzzy discriminant analysis based projection techniques. In addition, an extension to the non-linear discriminant analysis is proposed based on a mixture of differential evolution and fuzzy discriminant analysis.
The testing and verification process of the proposed methods on different EMG and EEG datasets provides very encouraging results
Energy performance forecasting of residential buildings using fuzzy approaches
The energy consumption used for domestic purposes in Europe is, to a considerable extent, due to heating and cooling. This energy is produced mostly by burning fossil fuels, which has a high negative environmental impact. The characteristics of a building are an important factor to determine the necessities of heating and cooling loads. Therefore, the study of the relevant characteristics of the buildings, regarding the heating and cooling needed to maintain comfortable indoor air conditions, could be very useful in order to design and construct energy-efficient buildings. In previous studies, different machine-learning approaches have been used to predict heating and cooling loads from the set of variables: relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area and glazing area distribution. However, none of these methods are based on fuzzy logic. In this research, we study two fuzzy logic approaches, i.e., fuzzy inductive reasoning (FIR) and adaptive neuro fuzzy inference system (ANFIS), to deal with the same problem. Fuzzy approaches obtain very good results, outperforming all the methods described in previous studies except one. In this work, we also study the feature selection process of FIR methodology as a pre-processing tool to select the more relevant variables before the use of any predictive modelling methodology. It is proven that FIR feature selection provides interesting insights into the main building variables causally related to heating and cooling loads. This allows better decision making and design strategies, since accurate cooling and heating load estimations and correct identification of parameters that affect building energy demands are of high importance to optimize building designs and equipment specifications.Peer ReviewedPostprint (published version
A cell outage management framework for dense heterogeneous networks
In this paper, we present a novel cell outage management (COM) framework for heterogeneous networks with split control and data planes-a candidate architecture for meeting future capacity, quality-of-service, and energy efficiency demands. In such an architecture, the control and data functionalities are not necessarily handled by the same node. The control base stations (BSs) manage the transmission of control information and user equipment (UE) mobility, whereas the data BSs handle UE data. An implication of this split architecture is that an outage to a BS in one plane has to be compensated by other BSs in the same plane. Our COM framework addresses this challenge by incorporating two distinct cell outage detection (COD) algorithms to cope with the idiosyncrasies of both data and control planes. The COD algorithm for control cells leverages the relatively larger number of UEs in the control cell to gather large-scale minimization-of-drive-test report data and detects an outage by applying machine learning and anomaly detection techniques. To improve outage detection accuracy, we also investigate and compare the performance of two anomaly-detecting algorithms, i.e., k-nearest-neighbor- and local-outlier-factor-based anomaly detectors, within the control COD. On the other hand, for data cell COD, we propose a heuristic Grey-prediction-based approach, which can work with the small number of UE in the data cell, by exploiting the fact that the control BS manages UE-data BS connectivity and by receiving a periodic update of the received signal reference power statistic between the UEs and data BSs in its coverage. The detection accuracy of the heuristic data COD algorithm is further improved by exploiting the Fourier series of the residual error that is inherent to a Grey prediction model. Our COM framework integrates these two COD algorithms with a cell outage compensation (COC) algorithm that can be applied to both planes. Our COC solution utilizes an actor-critic-based reinforcement learning algorithm, which optimizes the capacity and coverage of the identified outage zone in a plane, by adjusting the antenna gain and transmission power of the surrounding BSs in that plane. The simulation results show that the proposed framework can detect both data and control cell outage and compensate for the detected outage in a reliable manner
Mathematics at the eve of a historic transition in biology
A century ago physicists and mathematicians worked in tandem and established
quantum mechanism. Indeed, algebras, partial differential equations, group
theory, and functional analysis underpin the foundation of quantum mechanism.
Currently, biology is undergoing a historic transition from qualitative,
phenomenological and descriptive to quantitative, analytical and predictive.
Mathematics, again, becomes a driving force behind this new transition in
biology.Comment: 5 pages, 2 figure
- …