Search CORE

2,259 research outputs found

Recommended from our members

Learning Non-Homogenous Textures and the Unlearning Problem with Application to Drusen Detection in Retinal Images

Author: Laine Andrew F.
Lee Noah
Smith R. Theodore
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2008
Field of study

In this work we present a novel approach for learning non- homogenous textures without facing the unlearning problem. Our learning method mimics the human behavior of selective learning in the sense of fast memory renewal. We perform probabilistic boosting and structural similarity clustering for fast selective learning in a large knowledge domain acquired over different time steps. Applied to non- homogenous texture discrimination, our learning method is the first approach that deals with the unlearning problem applied to the task of drusen segmentation in retinal imagery, which itself is a challenging problem due to high variability of non-homogenous texture appearance. We present preliminary results

Columbia University Academic Commons

On pruning and feature engineering in Random Forests.

Author: Fawagreh Khaled
Publication venue
Publication date: 31/10/2016
Field of study

Random Forest (RF) is an ensemble classification technique that was developed by Leo Breiman over a decade ago. Compared with other ensemble techniques, it has proved its accuracy and superiority. Many researchers, however, believe that there is still room for optimizing RF further by enhancing and improving its performance accuracy. This explains why there have been many extensions of RF where each extension employed a variety of techniques and strategies to improve certain aspect(s) of RF. The main focus of this dissertation is to develop new extensions of RF using new optimization techniques that, to the best of our knowledge, have never been used before to optimize RF. These techniques are clustering, the local outlier factor, diversified weighted subspaces, and replicator dynamics. Applying these techniques on RF produced four extensions which we have termed CLUB-DRF, LOFB-DRF, DSB-RF, and RDB-DR respectively. Experimental studies on 15 real datasets showed favorable results, demonstrating the potential of the proposed methods. Performance-wise, CLUB-DRF is ranked first in terms of accuracy and classifcation speed making it ideal for real-time applications, and for machines/devices with limited memory and processing power

Open Access Institutional Repository at Robert Gordon University

Deep Clustering: A Comprehensive Survey

Author: He Lifang
Li Guofeng
Pu Jingyu
Pu Xiaorong
Ren Yazhou
Xu Jie
Yang Zhimeng
Yu Philip S.
Publication venue
Publication date: 08/10/2022
Field of study

Cluster analysis plays an indispensable role in machine learning and data mining. Learning a good data representation is crucial for clustering algorithms. Recently, deep clustering, which can learn clustering-friendly representations using deep neural networks, has been broadly applied in a wide range of clustering tasks. Existing surveys for deep clustering mainly focus on the single-view fields and the network architectures, ignoring the complex application scenarios of clustering. To address this issue, in this paper we provide a comprehensive survey for deep clustering in views of data sources. With different data sources and initial conditions, we systematically distinguish the clustering methods in terms of methodology, prior knowledge, and architecture. Concretely, deep clustering methods are introduced according to four categories, i.e., traditional single-view deep clustering, semi-supervised deep clustering, deep multi-view clustering, and deep transfer clustering. Finally, we discuss the open challenges and potential future opportunities in different fields of deep clustering

arXiv.org e-Print Archive