5 research outputs found

    Seasonal to Inter-annual Climate Prediction Using Data Mining KNN TYechnique”,

    Get PDF
    Abstract. The impact of seasonal to inter-annual climate prediction on society, business, agriculture and almost all aspects of human life, force the scientist to give proper attention to the matter. The last few years show tremendous achievements in this field. All systems and techniques developed so far, use the Sea Surface Temperature (SST) as the main factor, among other seasonal climatic attributes. Statistical and mathematical models are then used for further climate predictions. In this paper, we develop a system that uses the historical weather data of a region (rain, wind speed, dew point, temperature, etc.), and apply the data-mining algorithm "K-Nearest Neighbor (KNN)" for classification of these historical data into a specific time span. The k nearest time spans (k nearest neighbors) are then taken to predict the weather. Our experiments show that the system generates accurate results within reasonable time for months in advance

    Data Mining Techniques for Weather Prediction: A Review

    Get PDF
    Data mining is the computer assisted process of digging through and analysing enormous sets of data and then extracting the meaningful data. Data mining tools predicts behaviours and future trends, allowing businesses to make proactive decisions. It can answer questions that traditionally were very time consuming to resolve. Therefore they can be used to predict meteorological data that is weather prediction. Weather prediction is a vital application in meteorology and has been one of the most scientifically and technologically challenging problems across the world in the last century. Predicting the weather is essential to help preparing for the best and the worst of the climate. Accurate Weather Prediction has been one of the most challenging problems around the world. Many weather predictions like rainfall prediction, thunderstorm prediction, predicting cloud conditions are major challenges for atmospheric research. This paper presents the review of Data Mining Techniques for Weather Prediction and studies the benefit of using it. The paper provides a survey of available literatures of some algorithms employed by different researchers to utilize various data mining techniques, for Weather Prediction. The work that has been done by various researchers in this field has been reviewed and compared in a tabular form. For weather prediction, decision tree and k-mean clustering proves to be good with higher prediction accuracy than other techniques of data mining

    Penerapan Rough Set Dan Fuzzy Rough Set Untuk Klasifikasi Data Tidak Lengkap

    Get PDF
    Data mining merupakan salah satu proses untuk menemukan pola dan pengetahuan dari database. Sebagian besar database di dunia nyata tidak dapat dihindari dari masalah ketidaklengkapan. Hal ini disebabkan antara lain oleh kesalahan prosedur manual entri data, pengukuran yang salah, dan kesalahan peralatan. Salah satu database yang tidak terlepas dari masalah ketidaklengkapan adalah dataset meteorologi, sehingga diperlukan algoritma klasifikasi yang mampu menangani nilai atribut yang tidak lengkap dalam data meteorologi. Dataset meteorologi yang digunakan terdiri dari atribut temperatur, kelembaban, tekanan udara, kecepatan angin, dan curah hujan. Pada penelitian ini, penanganan data yang tidak lengkap menggunakan algoritma klasifikasi berbasis rough set dan fuzzy rough set. Hasil yang diperoleh berupa rules untuk mengklasifikasikan data meteorologi tidak lengkap pada data uji. Hasil pengujian algoritma rough set dan fuzzy rough set pada dataset yang memuat 5%, 10%, 15%, 20%, 25%, dan 30% missing value menunjukkan bahwa: (i) akurasi rules berbasis algoritma rough set mengalami penurunan ketika persentase missing value bertambah, sedangkan akurasi rules berbasis algoritma fuzzy rough set mengalami peningkatan ketika persentase missing value ditingkatkan sampai 25% dan akurasi mengalami penurunan ketika persentase missing value bertambah menjadi 30%, (ii) peningkatan persentase missing value mempengaruhi jumlah rules dan waktu komputasi pembentukan rules berbasis algoritma rough set, tetapi tidak berpengaruh pada jumlah rules dan waktu komputasi pembentukan rules berbasis algoritma fuzzy rough set, dan (iii) pada penerapan rules terhadap data uji, terdapat data uji yang tidak dapat diprediksi oleh rules berbasis algoritma rough set, tetapi dapat diprediksi oleh rules berbasis algoritma fuzzy rough set. ========== Data mining is a process of finding patterns and knowledge of the database. Most of the databases in the real world can not be avoided from the problem of incompleteness. This is caused partly by faulty procedure manual data entry, wrong measurements, and equipment faults. One of database that can not be separated from the problem of incompleteness is the meteorological dataset, so that required classification algorithm that capable of handling incomplete attribute values in meteorological data. Meteorological dataset is used consist of the average temperature, humidity, air pressure, wind of speed, and rainfall. In this study, the handling of incomplete data use classification algorithm based on rough sets and fuzzy rough sets. The results obtained in the form of rules for classifying the incomplete meteorological data on test data. Results of the testing rough set and fuzzy rough set algorithm on meteorological dataset containing 5%, 10%, 15%, 20%, 25%, and 30% missing value showed that: (i) the accuracy of the rules based rough set algorithm decreased when the percentage of missing value increases, while the accuracy of the rules based fuzzy rough set algorithm increased when the percentage of missing value increased to 25% and accuracy decreased when the percentage of missing value increased to 30%, (ii) an increase in the percentage of missing value affects the number of rules and computing time of forming rules based rough sets algorithm, but it had no effect on the number of rules and computing time of forming rules based fuzzy rough sets algorithms, and (iii) application of rules for the test data, there is a test data that can not be predicted by the rules based rough sets algorithm, but can be predicted by rules based fuzzy rough sets algorithms
    corecore