31,739 research outputs found
General fuzzy min-max neural network for clustering and classification
This paper describes a general fuzzy min-max (GFMM) neural network which is a generalization and extension of the fuzzy min-max clustering and classification algorithms of Simpson (1992, 1993). The GFMM method combines supervised and unsupervised learning in a single training algorithm. The fusion of clustering and classification resulted in an algorithm that can be used as pure clustering, pure classification, or hybrid clustering classification. It exhibits a property of finding decision boundaries between classes while clustering patterns that cannot be said to belong to any of existing classes. Similarly to the original algorithms, the hyperbox fuzzy sets are used as a representation of clusters and classes. Learning is usually completed in a few passes and consists of placing and adjusting the hyperboxes in the pattern space; this is an expansion-contraction process. The classification results can be crisp or fuzzy. New data can be included without the need for retraining. While retaining all the interesting features of the original algorithms, a number of modifications to their definition have been made in order to accommodate fuzzy input patterns in the form of lower and upper bounds, combine the supervised and unsupervised learning, and improve the effectiveness of operations. A detailed account of the GFMM neural network, its comparison with the Simpson's fuzzy min-max neural networks, a set of examples, and an application to the leakage detection and identification in water distribution systems are given
Semiblind Channel Estimation and Data Detection for OFDM Systems With Optimal Pilot Design
This paper considers semiblind channel estimation and data detection for orthogonal frequency-division multiplexing (OFDM) over frequency-selective fading channels. We show that the samples of an OFDM symbol are jointly complex Gaussian distributed, where the mean and covariance are determined by the locations and values of fixed pilot symbols. We exploit this distribution to derive a novel maximum-likelihood (ML) semiblind gradient-descent channel estimator. By exploiting the channel impulse response (CIR) statistics, we also derive a semiblind data detector for both Rayleigh and Ricean fading channels. Furthermore, we develop an enhanced data detector, which uses the estimator error statistics to mitigate the effect of channel estimation errors. Efficient implementation of both the semiblind and the improved data detectors is provided via sphere decoding and nulling-canceling detection. We also derive the Cramér-Rao bound (CRB) and design optimal pilots by minimizing the CRB. Our proposed channel estimator and data detector exhibit high bandwidth efficiency (requiring only a few pilot symbols), achieve the CRB, and also nearly reach the performance of an ideal reference receiver
A transferable machine-learning framework linking interstice distribution and plastic heterogeneity in metallic glasses
When metallic glasses (MGs) are subjected to mechanical loads, the plastic
response of atoms is non-uniform. However, the extent and manner in which
atomic environment signatures present in the undeformed structure determine
this plastic heterogeneity remain elusive. Here, we demonstrate that novel site
environment features that characterize interstice distributions around atoms
combined with machine learning (ML) can reliably identify plastic sites in
several Cu-Zr compositions. Using only quenched structural information as
input, the ML-based plastic probability estimates ("quench-in softness" metric)
can identify plastic sites that could activate at high strains, losing
predictive power only upon the formation of shear bands. Moreover, we reveal
that a quench-in softness model trained on a single composition and quenching
rate substantially improves upon previous models in generalizing to different
compositions and completely different MG systems (Ni62Nb38, Al90Sm10 and
Fe80P20). Our work presents a general, data-centric framework that could
potentially be used to address the structural origin of any site-specific
property in MGs
An improved random forest model of short-term wind-power forecasting to enhance accuracy, efficiency, and robustness
Short‐term wind‐power forecasting methods like neural networks are trained by empirical risk minimization. The local optimum and overfitting problem is likely to occur in the model‐training stage, leading to the poor ability of reasoning and generalization in the prediction stage. To solve the problem, a model of short‐term wind power forecasting is proposed based on 2‐stage feature selection and a supervised random forest in the paper. First, in data preprocessing, some redundant features can be removed by a variable importance measure method and intimate samples can be selected based on relevant analysis, so that the efficiency of model training and the correlation degree between input and output samples can be enhanced. Second, an improved supervised random forest (RF) methodology is proposed to compose a new RF based on evaluating the performance of each decision tree and restructuring the decision trees. A new index of external validation in correlation with wind speed in numerical weather prediction has been proposed to overcome the shortcomings of the internal validation index that seriously depends on the training samples. The simulation examples have verified the rationality and feasibility of the improvement. Case studies of measured data from a wind farm have shown that the proposed model has a better performance than the original RF, back propagation neural network, Bayesian network, and support vector machine, in aspects of ensuring accuracy, efficiency, and robustness, and especially if there is high rate of noisy data and wind power curtailment duration in the historical data
- …