Search CORE

14,886 research outputs found

On the role of pre and post-processing in environmental data mining

Author: Athanasiadis Ioannis
Comas Joaquim
Gibert Karina
Holmes Geoffrey
Izquierdo Joaquin
Sanchez-Marre Miquel
Publication venue: International Environmental Modelling and Software Society
Publication date: 01/01/2008
Field of study

The quality of discovered knowledge is highly depending on data quality. Unfortunately real data use to contain noise, uncertainty, errors, redundancies or even irrelevant information. The more complex is the reality to be analyzed, the higher the risk of getting low quality data. Knowledge Discovery from Databases (KDD) offers a global framework to prepare data in the right form to perform correct analyses. On the other hand, the quality of decisions taken upon KDD results, depend not only on the quality of the results themselves, but on the capacity of the system to communicate those results in an understandable form. Environmental systems are particularly complex and environmental users particularly require clarity in their results. In this paper some details about how this can be achieved are provided. The role of the pre and post processing in the whole process of Knowledge Discovery in environmental systems is discussed

Research Commons@Waikato

Sustainable and efficient energy consumption of corn production in Southwest Iran: combination of multi-fuzzy and DEA modeling

Author: Almassi Morteza
Azadi Hossein
Davoodi Mohammad Javad Sheikh
Houshyar Ehsan
Witlox Frank
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Ghent University Academic Bibliography

Institutional Repository Universiteit Antwerpen

Unsupervised Machine Learning and Data Mining Procedures Reveal Short Term, Climate Driven Patterns Linking Physico-Chemical Features and Zooplankton Diversity in Small Ponds

Author: Catia Maurone
Erica Racchetti
Marco Bartoli
Nicolò Bellin
Valeria Rossi
Publication venue: 'MDPI AG'
Publication date: 01/01/2021
Field of study

Machine Learning (ML) is an increasingly accessible discipline in computer science that develops dynamic algorithms capable of data-driven decisions and whose use in ecology is growing. Fuzzy sets are suitable descriptors of ecological communities as compared to other standard algorithms and allow the description of decisions that include elements of uncertainty and vagueness. However, fuzzy sets are scarcely applied in ecology. In this work, an unsupervised machine learning algorithm, fuzzy c-means and association rules mining were applied to assess the factors influencing the assemblage composition and distribution patterns of 12 zooplankton taxa in 24 shallow ponds in northern Italy. The fuzzy c-means algorithm was implemented to classify the ponds in terms of taxa they support, and to identify the influence of chemical and physical environmental features on the assemblage patterns. Data retrieved during 2014 and 2015 were compared, taking into account that 2014 late spring and summer air temperatures were much lower than historical records, whereas 2015 mean monthly air temperatures were much warmer than historical averages. In both years, fuzzy c-means show a strong clustering of ponds in two groups, contrasting sites characterized by different physico-chemical and biological features. Climatic anomalies, affecting the temperature regime, together with the main water supply to shallow ponds (e.g., surface runoff vs. groundwater) represent disturbance factors producing large interannual differences in the chemistry, biology and short-term dynamic of small aquatic ecosystems. Unsupervised machine learning algorithms and fuzzy sets may help in catching such apparently erratic differences

Archivio istituzionale della Ricerca - Università degli Studi di Parma

FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

Author: CP Chen
G Qin
J Cargile
J West
JA Evans
JC Bezdek
K Kowsari
L Bahl
M Russo
MJ Prabu
R Vilalta
R Wieland
RAR Ashfaq
S-S Choi
X Jiang
X Qiu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/11/2017
Field of study

This paper introduces a novel real-time Fuzzy Supervised Learning with Binary Meta-Feature (FSL-BM) for big data classification task. The study of real-time algorithms addresses several major concerns, which are namely: accuracy, memory consumption, and ability to stretch assumptions and time complexity. Attaining a fast computational model providing fuzzy logic and supervised learning is one of the main challenges in the machine learning. In this research paper, we present FSL-BM algorithm as an efficient solution of supervised learning with fuzzy logic processing using binary meta-feature representation using Hamming Distance and Hash function to relax assumptions. While many studies focused on reducing time complexity and increasing accuracy during the last decade, the novel contribution of this proposed solution comes through integration of Hamming Distance, Hash function, binary meta-features, binary classification to provide real time supervised method. Hash Tables (HT) component gives a fast access to existing indices; and therefore, the generation of new indices in a constant time complexity, which supersedes existing fuzzy supervised algorithms with better or comparable results. To summarize, the main contribution of this technique for real-time Fuzzy Supervised Learning is to represent hypothesis through binary input as meta-feature space and creating the Fuzzy Supervised Hash table to train and validate model.Comment: FICC201

arXiv.org e-Print Archive

Crossref

Comparison of three modelling approaches of potential natural forest habitats in Bavaria, Germany

Author: Förster Michael
Kleinschmit Birgit
Walentowski Helge
Publication venue
Publication date: 01/01/2005
Field of study

In the context of the EU Habitats Directive, which contains the obligation of environmental monitoring, nature conservation authorities face a growing demand for effective and competitive methods to survey protected habitats. Therefore the presented research study compared three modelling approaches (rule-based method with applied Bavarian woodland types, multivariate technique of cluster analysis, and a fuzzy logic approach) for the purpose of detecting potential habitat types. The results can be combined with earth observation data of different geometric resolution (ASTER, SPOT5, aerial photographs or very high resolution satellite data) in order to determine actual forest habitat types. This was carried out at two test sites, situated in the pre-alpine area in Bavaria (southern Germany). The results were subsequently compared to the terrestrial mapped habitat areas of the NATURA 2000 management plans. First results show that these techniques are a valuable support in mapping and monitoring NATURA 2000 forest habitats

Hochschulschriftenserver - Universität Frankfurt am Main