470 research outputs found
Interoperable intelligent environmental decision support systems: a framework proposal
In this paper, an approach for the development of Interoperable Intelligent Environmental Decision Support Systems (IEDSS) is proposed. The framework is based upon the cognitive-oriented approach for the development of IEDSS proposed in (Sànchez-Marrè et al., 2008), where three kind of tasks must be built: analysis tasks, synthesis tasks and prognosis tasks. Now, a fourth level will be
proposed: the model construction layer, which is normally an off-line task. At each level, interoperability should be possible and inter-level interoperability must be als
o achieved. This interoperability is proposed to be obtained using data interchange protocols like Predictive Model Markup Language (PMML), which is a model interc
hange protocol based on XML language, using an ontology of data and AI models to characterize data types and AI models and to set-up a common terminology, and using
workflows of the whole interoperation scheme. In the future, a Multi-Agent System will be used to implement the software components. An example of use of the pro
posed methodology applied to the supervision of a Wastewater Treatment Plant is provided. This Interoperable IEDSS framework will be the first step to an actual interoperability of AI models which will make IEDSS more reliable and accurate to solve complex environmental problems.Peer ReviewedPostprint (published version
Using contextual information in music playlist recommendations
Recommender Systems have become a fundamental part of various applications supporting users when searching for items they could be interested in,at a given moment. However, the majority of Recommender Systems generate isolate item recommendations based mainly on user-item interactions, without taking into account other important information about the recommendation moment, able to deliver users a more complete experience. In this paper, a hybrid Case-based Reasoning model generating recommendations of sets of music items, based on the underlying structures found in previous playlists, is proposed. Furthermore, the described system takes into account the similarity of the basic contextual information of the current and the past recommendation moments. The initial evaluation shows that the proposed approach may deliver recommendations of equal and higher accuracy than some of the widely used techniques.Peer ReviewedPostprint (author's final draft
Dynamic learning of cases from data streams
This paper presents a dynamic adaptive framework for building a case library being able to cope with a data stream in the field of Case-Based Reasoning. The framework provides a three-layer architecture formed by a set of case libraries dynamically built. This Dynamic and Adaptive Case Library (DACL), can process in an incremental way a data stream, and can be used as a classification model or a regression model, depending on the predicted variable. In this paper, the work is focused on classification tasks. Each case library has a first layer formed by the dynamic clusters of cases, a second one formed by the meta-cases or prototypes of the cluster, and a third one formed by an incremental indexing structure. In our approach, some variant of k-d tres have been used, in addition to an exploration technique to get a more efficient retrieval time. This three-layer famework can be constructed in an incremental way. Several meta-case learning approaches are proposed, as well as some case learning strategies. The framework has been tested with several datasets. The experimental results show a very good performance in comparison with a batch learning scheme over the same data.Peer ReviewedPostprint (author's final draft
Data mining as a tool for environmental scientists
Over recent years a huge library of data mining algorithms has been developed to tackle a variety of problems in fields such as medical imaging and network traffic analysis. Many of these techniques are far more flexible than more classical modelling approaches and could be usefully applied to data-rich environmental problems. Certain techniques such as Artificial Neural Networks, Clustering, Case-Based Reasoning and more recently Bayesian Decision Networks have found application in environmental modelling while other methods, for example classification and association rule extraction, have not yet been taken up on any wide scale. We propose that these and other data mining techniques could be usefully applied to difficult problems in the field. This paper introduces several data mining concepts and briefly discusses their application to environmental modelling, where data may be sparse, incomplete, or heterogenous
Using contextual information in music playlist recommendations
Recommender Systems have become a fundamental part of various applications supporting users when searching for items they could be interested in, at a given moment. However, the majority of Recommender Systems generate isolate item recommendations based mainly on user-item interactions, without taking into account other important information about the recommendation moment, able to deliver users a more complete experience. In this paper, a hybrid Case-based Reasoning model generating recommendations of sets of music items, based on the underlying structures found in previous playlists, is proposed. Furthermore, the described system takes into account the similarity of the basic contextual information of the current and the past recommendation moments. The initial evaluation shows that the proposed approach may deliver recommendations of equal and higher accuracy than some of the widely used techniquePeer ReviewedPostprint (author's final draft
Providing intelligent decision support systems with flexible data-intensive case-based reasoning
In this paper we present a flexible CBR shell for Data-Intensive Case-Based Reasoning
Systems which is fully integrated in an Intelligent Data Analysis Tool entitled GESCONDA. The main subgoal of the developed tool is to create a CBR Shell where no fixed domain exists and where letting the expert/user creates (models) his/her own domain. From an abstract point of view,
the definition of the CBR can be seen as a methodology composed by four phases and each phase offers different ways to be solved. Then, since the CBR shell is integrated in GESCONDA, it inherits all its functionalities which cover the whole knowledge discovery and data mining process
and also, CBR can complement its phases with this functionality. As a result, GESCONDA becomes an intelligent decision support tool which encompasses a number of advantages including domain independence, incremental learning, platform independence and generality.Peer ReviewedPostprint (published version
Environmental data stream mining through a case-based stochastic learning approach
© . This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/Environmental data stream mining is an open challenge for Data Science. Common methods used are static because they analyze a static set of data, and provide static data-driven models. Environmental systems are dynamic and generate a continuous data stream. Dynamic methods coping with the temporal nature of data must be provided in Data Science. Our proposal is to model each environmental information unit, timely generated, as a new case/experience in a Case-Based Reasoning (CBR) system. This contribution aims to incrementally build and manage a Dynamic Adaptive Case Library (DACL). In this paper, a stochastic method for the learning of new cases and management of prototypes to create and manage the DACL in an incremental way is introduced. This stochastic method works with two main moments. An evaluation of the method has been carried using a data stream of air quality of the city of Obregon, Sonora. México, with good results. In addition, other datasets have been mined to ensure the generality of the approach.Peer ReviewedPostprint (author's final draft
Analysing similarity assessment in feature-vector case representations
Case-Based Reasoning (CBR) is a good technique to solve new problems based in previous experience. Main assumption in CBR relies in the hypothesis that similar problems should have similar solutions. CBR systems retrieve the most similar cases or experiences among those stored in the Case Base. Then, previous solutions given to these most similar past-solved cases can be adapted to fit new solutions for new cases or problems in a particular domain, instead of derive them from scratch. Thus, similarity measures are key elements in obtaining reliable similar cases, which will be used to derive solutions for new cases. This paper describes a comparative analysis of several commonly used similarity measures, including a measure previously developed by the authors, and a study on its performance in the CBR retrieval step for feature-vector case representations. The testing has been done using six-teen data sets from the UCI Machine Learning Database Repository, plus two complex environmental databases.Postprint (published version
Crossing the death valley to transfer environmental decision support systems to the water market
Environmental decision support systems (EDSSs) are attractive tools to cope with the complexity of environmental global challenges. Several thoughtful reviews have analyzed EDSSs to identify the key challenges and best practices for their development. One of the major criticisms is that a wide and generalized use of deployed EDSSs has not been observed. The paper briefly describes and compares four case studies of EDSSs applied to the water domain, where the key aspects involved in the initial conception and the use and transfer evolution that determine the final success or failure of these tools (i.e., market uptake) are identified. Those aspects that contribute to bridging the gap between the EDSS science and the EDSS market are highlighted in the manuscript. Experience suggests that the construction of a successful EDSS should focus significant efforts on crossing the death-valley toward a general use implementation by society (the market) rather than on development.The authors would like to thank the Catalan Water Agency (Agència Catalana de l’Aigua), Besòs River Basin Regional Administration
(Consorci per la Defensa de la Conca del Riu Besòs), SISLtech, and Spanish Ministry of Science and Innovation for providing funding
(CTM2012-38314-C02-01 and CTM2015-66892-R). LEQUIA, KEMLG, and
ICRA were recognized as consolidated research groups by the Catalan
Government under the codes 2014-SGR-1168, 2013-SGR-1304 and
2014-SGR-291.Peer ReviewedPostprint (published version
- …