67,679 research outputs found
Research on Precipitation Prediction Model Based on Extreme Learning Machine Ensemble
Precipitation is a significant index to measure the degree of drought and flood in a region, which directly reflects the local natural changes and ecological environment. It is very important to grasp the change characteristics and law of precipitation accurately for effectively reducing disaster loss and maintaining the stable development of a social economy. In order to accurately predict precipitation, a new precipitation prediction model based on extreme learning machine ensemble (ELME) is proposed. The integrated model is based on the extreme learning machine (ELM) with different kernel functions and supporting parameters, and the submodel with the minimum root mean square error (RMSE) is found to fit the test data. Due to the complex mechanism and factors affecting precipitation change, the data have strong uncertainty and significant nonlinear variation characteristics. The mean generating function (MGF) is used to generate the continuation factor matrix, and the principal component analysis technique is employed to reduce the dimension of the continuation matrix, and the effective data features are extracted. Finally, the ELME prediction model is established by using the precipitation data of Liuzhou city from 1951 to 2021 in June, July and August, and a comparative experiment is carried out by using ELM, long-term and short-term memory neural network (LSTM) and back propagation neural network based on genetic algorithm (GA-BP). The experimental results show that the prediction accuracy of the proposed method is significantly higher than that of other models, and it has high stability and reliability, which provides a reliable method for precipitation prediction
MEG Decoding Across Subjects
Brain decoding is a data analysis paradigm for neuroimaging experiments that
is based on predicting the stimulus presented to the subject from the
concurrent brain activity. In order to make inference at the group level, a
straightforward but sometimes unsuccessful approach is to train a classifier on
the trials of a group of subjects and then to test it on unseen trials from new
subjects. The extreme difficulty is related to the structural and functional
variability across the subjects. We call this approach "decoding across
subjects". In this work, we address the problem of decoding across subjects for
magnetoencephalographic (MEG) experiments and we provide the following
contributions: first, we formally describe the problem and show that it belongs
to a machine learning sub-field called transductive transfer learning (TTL).
Second, we propose to use a simple TTL technique that accounts for the
differences between train data and test data. Third, we propose the use of
ensemble learning, and specifically of stacked generalization, to address the
variability across subjects within train data, with the aim of producing more
stable classifiers. On a face vs. scramble task MEG dataset of 16 subjects, we
compare the standard approach of not modelling the differences across subjects,
to the proposed one of combining TTL and ensemble learning. We show that the
proposed approach is consistently more accurate than the standard one
Comparative Analysis of Machine Learning Algorithms for Solar Irradiance Forecasting in Smart Grids
The increasing global demand for clean and environmentally friendly energy
resources has caused increased interest in harnessing solar power through
photovoltaic (PV) systems for smart grids and homes. However, the inherent
unpredictability of PV generation poses problems associated with smart grid
planning and management, energy trading and market participation, demand
response, reliability, etc. Therefore, solar irradiance forecasting is
essential for optimizing PV system utilization. This study proposes the
next-generation machine learning algorithms such as random forests, Extreme
Gradient Boosting (XGBoost), Light Gradient Boosted Machine (lightGBM)
ensemble, CatBoost, and Multilayer Perceptron Artificial Neural Networks
(MLP-ANNs) to forecast solar irradiance. Besides, Bayesian optimization is
applied to hyperparameter tuning. Unlike tree-based ensemble algorithms that
select the features intrinsically, MLP-ANN needs feature selection as a
separate step. The simulation results indicate that the performance of the
MLP-ANNs improves when feature selection is applied. Besides, the random forest
outperforms the other learning algorithms.Comment: 6 pages, 4 figures, 3 tables, to appear in the 13th Smart Grid
Conferenc
Ensemble Feature Learning-Based Event Classification for Cyber-Physical Security of the Smart Grid
The power grids are transforming into the cyber-physical smart grid with increasing two-way communications and abundant data flows. Despite the efficiency and reliability promised by this transformation, the growing threats and incidences of cyber attacks targeting the physical power systems have exposed severe vulnerabilities. To tackle such vulnerabilities, intrusion detection systems (IDS) are proposed to monitor threats for the cyber-physical security of electrical power and energy systems in the smart grid with increasing machine-to-machine communication. However, the multi-sourced, correlated, and often noise-contained data, which record various concurring cyber and physical events, are posing significant challenges to the accurate distinction by IDS among events of inadvertent and malignant natures. Hence, in this research, an ensemble learning-based feature learning and classification for cyber-physical smart grid are designed and implemented. The contribution of this research are (i) the design, implementation and evaluation of an ensemble learning-based attack classifier using extreme gradient boosting (XGBoost) to effectively detect and identify attack threats from the heterogeneous cyber-physical information in the smart grid; (ii) the design, implementation and evaluation of stacked denoising autoencoder (SDAE) to extract highlyrepresentative feature space that allow reconstruction of a noise-free input from noise-corrupted
perturbations; (iii) the design, implementation and evaluation of a novel ensemble learning-based feature extractors that combine multiple autoencoder (AE) feature extractors and random forest base classifiers, so as to enable accurate reconstruction of each feature and reliable classification against malicious events. The simulation results validate the usefulness of ensemble learning approach in detecting malicious events in the cyber-physical smart grid
XGBOD: Improving Supervised Outlier Detection with Unsupervised Representation Learning
A new semi-supervised ensemble algorithm called XGBOD (Extreme Gradient
Boosting Outlier Detection) is proposed, described and demonstrated for the
enhanced detection of outliers from normal observations in various practical
datasets. The proposed framework combines the strengths of both supervised and
unsupervised machine learning methods by creating a hybrid approach that
exploits each of their individual performance capabilities in outlier
detection. XGBOD uses multiple unsupervised outlier mining algorithms to
extract useful representations from the underlying data that augment the
predictive capabilities of an embedded supervised classifier on an improved
feature space. The novel approach is shown to provide superior performance in
comparison to competing individual detectors, the full ensemble and two
existing representation learning based algorithms across seven outlier
datasets.Comment: Proceedings of the 2018 International Joint Conference on Neural
Networks (IJCNN
Bioactive molecule prediction using extreme gradient boosting
Following the explosive growth in chemical and biological data, the shift from traditional methods of drug discovery to computer-aided means has made data mining and machine learning methods integral parts of today's drug discovery process. In this paper, extreme gradient boosting (Xgboost), which is an ensemble of Classification and Regression Tree (CART) and a variant of the Gradient Boosting Machine, was investigated for the prediction of biological activity based on quantitative description of the compound's molecular structure. Seven datasets, well known in the literature were used in this paper and experimental results show that Xgboost can outperform machine learning algorithms like Random Forest (RF), Support Vector Machines (LSVM), Radial Basis Function Neural Network (RBFN) and Naïve Bayes (NB) for the prediction of biological activities. In addition to its ability to detect minority activity classes in highly imbalanced datasets, it showed remarkable performance on both high and low diversity datasets
- …