Search CORE

7,272 research outputs found

Adaptive Online Sequential ELM for Concept Drift Tackling

Author: Basaruddin Chan
Budiman Arif
Fanany Mohamad Ivan
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2016
Field of study

A machine learning method needs to adapt to over time changes in the environment. Such changes are known as concept drift. In this paper, we propose concept drift tackling method as an enhancement of Online Sequential Extreme Learning Machine (OS-ELM) and Constructive Enhancement OS-ELM (CEOS-ELM) by adding adaptive capability for classification and regression problem. The scheme is named as adaptive OS-ELM (AOS-ELM). It is a single classifier scheme that works well to handle real drift, virtual drift, and hybrid drift. The AOS-ELM also works well for sudden drift and recurrent context change type. The scheme is a simple unified method implemented in simple lines of code. We evaluated AOS-ELM on regression and classification problem by using concept drift public data set (SEA and STAGGER) and other public data sets such as MNIST, USPS, and IDS. Experiments show that our method gives higher kappa value compared to the multiclassifier ELM ensemble. Even though AOS-ELM in practice does not need hidden nodes increase, we address some issues related to the increasing of the hidden nodes such as error condition and rank values. We propose taking the rank of the pseudoinverse matrix as an indicator parameter to detect underfitting condition.Comment: Hindawi Publishing. Computational Intelligence and Neuroscience Volume 2016 (2016), Article ID 8091267, 17 pages Received 29 January 2016, Accepted 17 May 2016. Special Issue on "Advances in Neural Networks and Hybrid-Metaheuristics: Theory, Algorithms, and Novel Engineering Applications". Academic Editor: Stefan Hauf

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

PubMed Central

Mining Frequent Item sets in Data Streams

Author: Dass Rajanish
Publication venue
Publication date
Field of study

Research Papers in Economics

Computational framework to analyze agrometeorological, climate and remote sensing data: challenges and perspectives.

Author: AVILA A. M. H.
RODRIGUES JR. J. F.
ROMANI L. A. S.
SOUSA E. P. M. de
TRAINA JÚNIOR. C.
TRAINA A. J. M.
ZULLO JÚNIOR J.
Publication venue
Publication date: 31/01/2020
Field of study

In the past few years, improvements in the data acquisition technology have decreased the time interval of data gathering. Consequently, institutions have stored huge amounts of data such as climate time series and remote sensing images. Computational models to filter, transform, merge and analyze data from many different areas are complex and challenging. The complexity increases even more when combining several knowledge domains. Examples are research in climatic changes, biofuel production and environmental problems. A possible solution to the problem is the association of several computational techniques. Accordingly, this paper presents a framework to analyze, monitor and visualize climate and remote sensing data by employing methods based on fractal theory, data mining and visualization techniques. Initial experiments showed that the information and knowledge discovered from this framework can be employed to monitor sugar cane crops, helping agricultural entrepreneurs to make decisions in order to become more productive. Sugar cane is the main source to ethanol production in Brazil, and has a strategic importance for the country economy and to guarantee the Brazilian self-sufficiency in this important, renewable source of energy.CSBC 2009

Repository Open Access to Scientific Information from Embrapa

Detecting early signs of depressive and manic episodes in patients with bipolar disorder using the signature-based model

Author: Geddes John R.
Harrison Paul J.
Kormilitzin Andrey
Lyons Terry
Saunders Kate E. A.
Publication venue
Publication date: 01/01/2017
Field of study

Recurrent major mood episodes and subsyndromal mood instability cause substantial disability in patients with bipolar disorder. Early identification of mood episodes enabling timely mood stabilisation is an important clinical goal. Recent technological advances allow the prospective reporting of mood in real time enabling more accurate, efficient data capture. The complex nature of these data streams in combination with challenge of deriving meaning from missing data mean pose a significant analytic challenge. The signature method is derived from stochastic analysis and has the ability to capture important properties of complex ordered time series data. To explore whether the onset of episodes of mania and depression can be identified using self-reported mood data.Comment: 12 pages, 3 tables, 10 figure

arXiv.org e-Print Archive

Oxford University Research Archive

Reservoir of Diverse Adaptive Learners and Stacking Fast Hoeffding Drift Detection Methods for Evolving Data Streams

Author: Paquet Eric
Pesaranghader Ali
Viktor Herna
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/09/2017
Field of study

The last decade has seen a surge of interest in adaptive learning algorithms for data stream classification, with applications ranging from predicting ozone level peaks, learning stock market indicators, to detecting computer security violations. In addition, a number of methods have been developed to detect concept drifts in these streams. Consider a scenario where we have a number of classifiers with diverse learning styles and different drift detectors. Intuitively, the current 'best' (classifier, detector) pair is application dependent and may change as a result of the stream evolution. Our research builds on this observation. We introduce the \mbox{Tornado} framework that implements a reservoir of diverse classifiers, together with a variety of drift detection algorithms. In our framework, all (classifier, detector) pairs proceed, in parallel, to construct models against the evolving data streams. At any point in time, we select the pair which currently yields the best performance. We further incorporate two novel stacking-based drift detection methods, namely the \mbox{FHDDMS} and \mbox{FHDDMS}_{add} approaches. The experimental evaluation confirms that the current 'best' (classifier, detector) pair is not only heavily dependent on the characteristics of the stream, but also that this selection evolves as the stream flows. Further, our \mbox{FHDDMS} variants detect concept drifts accurately in a timely fashion while outperforming the state-of-the-art.Comment: 42 pages, and 14 figure

arXiv.org e-Print Archive

NRC Publications Archive

Sequential pattern mining with uncertain data

Author: Ge Jiaqi
Publication venue: 'Purdue University (bepress)'
Publication date: 01/01/2016
Field of study

In recent years, a number of emerging applications, such as sensor monitoring systems, RFID networks and location based services, have led to the proliferation of uncertain data. However, traditional data mining algorithms are usually inapplicable in uncertain data because of its probabilistic nature. Uncertainty has to be carefully handled; otherwise, it might significantly downgrade the quality of underlying data mining applications. Therefore, we extend traditional data mining algorithms into their uncertain versions so that they still can produce accurate results. In particular, we use a motivating example of sequential pattern mining to illustrate how to incorporate uncertain information in the process of data mining. We use possible world semantics to interpret two typical types of uncertainty: the tuple-level existential uncertainty and the attribute-level temporal uncertainty. In an uncertain database, it is probabilistic that a pattern is frequent or not; thus, we define the concept of probabilistic frequent sequential patterns. And various algorithms are designed to mine probabilistic frequent patterns efficiently in uncertain databases. We also implement our algorithms on distributed computing platforms, such as MapReduce and Spark, so that they can be applied in large scale databases. Our work also includes uncertainty computation in supervised machine learning algorithms. We develop an artificial neural network to classify numeric uncertain data; and a Naive Bayesian classifier is designed for classifying categorical uncertain data streams. We also propose a discretization algorithm to pre-process numerical uncertain data, since many classifiers work with categoric data only. And experimental results in both synthetic and real-world uncertain datasets demonstrate that our methods are effective and efficient

Purdue E-Pubs

AI for BIM-Based Sustainable Building Design:integrating knowledge discovery and semantic data modelling for evidence-based design decision support

Author: Petrova Ekaterina Aleksandrova
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2019
Field of study

VBN