4 research outputs found

    Swarm optimized organizing map (SWOM): A swarm intelligence basedoptimization of self-organizing map

    No full text
    This work studies the optimization of SOM algorithm in terms of reducing its training time by the use of a swarm intelligence method, i.e. particle swarm optimization (PSO). Our novel algorithm optimizes SOM with PSO and reduces computational time of the training phase of SOM significantly. The performance of the algorithms has been tested with genomic datasets, biomedical datasets and an artificial dataset to show the efficiency of swarm optimized SOM, i.e. SWOM. The experimental comparison between SOM and SWOM, has demonstrated significant reduction in training time of SWOM with preservation of clustering quality. © 2009 Elsevier Ltd. All rights reserved

    A New Approach for Prediction of Solar Radiation with Using Ensemble Learning Algorithm

    No full text
    This article investigates the competence of ensemble learning techniques in solar irradiance prediction. It was seen from the literature survey, an ensemble tree model, random forests is studied more frequently as ensemble models. However, ensemble of support vector regression (SVR) and artificial neural networks (ANN) is also possible. So, this study is the first detailed evaluation of ensemble models in solar irradiance estimation domain. Boosting and bagging ensembles of SVR, ANN and decision tree (DT), are developed to estimate solar irradiance in hourly basis in five cities in Turkey. First frequently used base models (SVR, ANN, and DT) are created and tested with the use of 5 years meteorological data. Then boosting and bagging ensembles of the base models are developed and tested with the same data. The base models are compared with their ensemble counterparts in terms of average coefficient of determination (R2) and root mean squared error (RMSE). The comparative results show that boosting and bagging ensemble models improve SVR, ANN, and DT in terms of RMSE between 4.6 and 14.6% in average. The results show empirically that ensemble models improve prediction accuracies of various base regression models and it can be applied to other machine learning models used in solar irradiance prediction. © 2019, King Fahd University of Petroleum & Minerals

    TTC-3600: A new benchmark dataset for Turkish text categorization

    No full text
    Owing to the rapid growth of the World Wide Web, the number of documents that can be accessed via the Internet explosively increases with each passing day. Considering news portals in particular, sometimes documents related to categories such as technology, sports and politics seem to be in the wrong category or documents are located in a generic category called others. At this point, text categorization (TC), which is generally addressed as a supervised learning task is needed. Although there are substantial number of studies conducted on TC in other languages, the number of studies conducted in Turkish is very limited owing to the lack of accessibility and usability of datasets created. In this paper, a new dataset named TTC-3600, which can be widely used in studies of TC of Turkish news and articles, is created. TTC-3600 is a well-documented dataset and its file formats are compatible with well-known text mining tools. Five widely used classifiers within the field of TC and two feature selection methods are evaluated on TTC-3600. The experimental results indicate that the best accuracy criterion value 91.03% is obtained with the combination of Random Forest classifier and attribute ranking-based feature selection method in all comparisons performed after pre-processing and feature selection steps. The publicly available TTC-3600 dataset and the experimental results of this study can be utilized in comparative experiments by other researchers. © Chartered Institute of Library and Information Professionals
    corecore