Search CORE

4,684 research outputs found

Evolving Ensemble Fuzzy Classifier

Author: Lughofer Edwin
Pedrycz Witold
Pratama Mahardhika
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

The concept of ensemble learning offers a promising avenue in learning from data streams under complex environments because it addresses the bias and variance dilemma better than its single model counterpart and features a reconfigurable structure, which is well suited to the given context. While various extensions of ensemble learning for mining non-stationary data streams can be found in the literature, most of them are crafted under a static base classifier and revisits preceding samples in the sliding window for a retraining step. This feature causes computationally prohibitive complexity and is not flexible enough to cope with rapidly changing environments. Their complexities are often demanding because it involves a large collection of offline classifiers due to the absence of structural complexities reduction mechanisms and lack of an online feature selection mechanism. A novel evolving ensemble classifier, namely Parsimonious Ensemble pENsemble, is proposed in this paper. pENsemble differs from existing architectures in the fact that it is built upon an evolving classifier from data streams, termed Parsimonious Classifier pClass. pENsemble is equipped by an ensemble pruning mechanism, which estimates a localized generalization error of a base classifier. A dynamic online feature selection scenario is integrated into the pENsemble. This method allows for dynamic selection and deselection of input features on the fly. pENsemble adopts a dynamic ensemble structure to output a final classification decision where it features a novel drift detection scenario to grow the ensemble structure. The efficacy of the pENsemble has been numerically demonstrated through rigorous numerical studies with dynamic and evolving data streams where it delivers the most encouraging performance in attaining a tradeoff between accuracy and complexity.Comment: this paper has been published by IEEE Transactions on Fuzzy System

arXiv.org e-Print Archive

Role of Intellectual Peoperty Rights in the Benefit Sharing Arrangements: The Case of Bio-resources Development and Conservation Program in Nigeria

Author: Gupta Anil K.
Publication venue
Publication date
Field of study

The subject of this case study is the role of intellectual property rights in the benefit-sharing arrangements surrounding the work of the Bio-resources Development and Conservation Programme (BDCP) as a part of the International Cooperative Biodiversity Group (ICBG) in the field of traditional medicine. In particular the role of patents, trade secrets and trademarks are discussed. The case examines, inter alia, a national patent and an "international" patent application under the Patent Cooperation Treaty (PCT), with claims over TK-based pharmaceutical inventions related to the work of the ICBG. Copies of these patents are attached in Annexes 3.4.3 and 3.4.4. Based on these examples, the availability of patent protection is identified as a key requisite for generating benefits to be shared with local practitioners of traditional medicine from pharmaceutical research based on their knowledge. The central role of a Trust Fund established by BDCP for sharing these benefits in monetary and non-monetary form is highlighted. The case study also illustrates the difficulty of balancing the input of various local stakeholders of TK and biological resources, such as traditional healers associations vis-�-vis local community representatives. This is a part of WIPO sponsored study on the role of intellectual property rights in the sharing of benefits arising from the use of biological resources and associated traditional knowledge.

Evaluation methods and decision theory for classification of streaming data with temporal dependence

Author: Bifet Albert
Holmes Geoffrey
Pfahringer Bernhard
Read Jesse
Žliobaitė Indrė
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Predictive modeling on data streams plays an important role in modern data analysis, where data arrives continuously and needs to be mined in real time. In the stream setting the data distribution is often evolving over time, and models that update themselves during operation are becoming the state-of-the-art. This paper formalizes a learning and evaluation scheme of such predictive models. We theoretically analyze evaluation of classifiers on streaming data with temporal dependence. Our findings suggest that the commonly accepted data stream classification measures, such as classification accuracy and Kappa statistic, fail to diagnose cases of poor performance when temporal dependence is present, therefore they should not be used as sole performance indicators. Moreover, classification accuracy can be misleading if used as a proxy for evaluating change detectors with datasets that have temporal dependence. We formulate the decision theory for streaming data classification with temporal dependence and develop a new evaluation methodology for data stream classification that takes temporal dependence into account. We propose a combined measure for classification performance, that takes into account temporal dependence, and we recommend using it as the main performance measure in classification of streaming data

White learning methodology: a case study of cancer-related disease factors analysis in real-time PACS environment

Author: Fong S.
Fong S.
Li T.
Li T.
Liu L.
Liu L.
Mohammed S.
Mohammed S.
Siu S.
Siu S.
Yang X.
Yang X.
Publication venue: Elsevier Science
Publication date: 01/01/2020
Field of study

Bayesian network is a probabilistic model of which the prediction accuracy may not be one of the highest in the machine learning family. Deep learning (DL) on the other hand possess of higher predictive power than many other models. How reliable the result is, how it is deduced, how interpretable the prediction by DL mean to users, remain obscure. DL functions like a black box. As a result, many medical practitioners are reductant to use deep learning as the only tool for critical machine learning application, such as aiding tool for cancer diagnosis. In this paper, a framework of white learning is being proposed which takes advantages of both black box learning and white box learning. Usually, black box learning will give a high standard of accuracy and white box learning will provide an explainable direct acyclic graph. According to our design, there are 3 stages of White Learning, loosely coupled WL, semi coupled WL and tightly coupled WL based on degree of fusion of the white box learning and black box learning. In our design, a case of loosely coupled WL is tested on breast cancer dataset. This approach uses deep learning and an incremental version of Naïve Bayes network. White learning is largely defied as a systemic fusion of machine learning models which result in an explainable Bayes network which could find out the hidden relations between features and class and deep learning which would give a higher accuracy of prediction than other algorithms. We designed a series of experiments for this loosely coupled WL model. The simulation results show that using WL compared to standard black-box deep learning, the levels of accuracy and kappa statistics could be enhanced up to 50%. The performance of WL seems more stable too in extreme conditions such as noise and high dimensional data. The relations by Bayesian network of WL are more concise and stronger in affinity too. The experiments results deliver positive signals that WL is possible to output both high classification accuracy and explainable relations graph between features and class. [Abstract copyright: Copyright © 2020. Published by Elsevier B.V.