Search CORE

10 research outputs found

clues: An R Package for Nonparametric Clustering Based on Local Shrinking

Author: Fang Chang
Ross Lazarus
Ruben H. Zamar
Weiliang Qiu
Xiaogang Wang
Publication venue
Publication date
Field of study

Determining the optimal number of clusters appears to be a persistent and controversial issue in cluster analysis. Most existing R packages targeting clustering require the user to specify the number of clusters in advance. However, if this subjectively chosen number is far from optimal, clustering may produce seriously misleading results. In order to address this vexing problem, we develop the R package clues to automate and evaluate the selection of an optimal number of clusters, which is widely applicable in the field of clustering analysis. Package clues uses two main procedures, shrinking and partitioning, to estimate an optimal number of clusters by maximizing an index function, either the CH index or the Silhouette index, rather than relying on guessing a pre-specified number. Five agreement indices (Rand index, Hubert and ArabieÃ¢ÂÂs adjusted Rand index, Morey and AgrestiÃ¢ÂÂs adjusted Rand index, Fowlkes and Mallows index and Jaccard index), which measure the degree of agreement between any two partitions, are also provided in clues. In addition to numerical evidence, clues also supplies a deeper insight into the partitioning process with trajectory plots.

Research Papers in Economics

clues: An R Package for Nonparametric Clustering Based on Local Shrinking

Author: Chang Fang
Lazarus Ross
Qiu Weiliang
Wang Xiaogang
Zamar Ruben H.
Publication venue: 'Foundation for Open Access Statistic'
Publication date: 01/01/2010
Field of study

Determining the optimal number of clusters appears to be a persistent and controversial issue in cluster analysis. Most existing R packages targeting clustering require the user to specify the number of clusters in advance. However, if this subjectively chosen number is far from optimal, clustering may produce seriously misleading results. In order to address this vexing problem, we develop the R package clues to automate and evaluate the selection of an optimal number of clusters, which is widely applicable in the field of clustering analysis. Package clues uses two main procedures, shrinking and partitioning, to estimate an optimal number of clusters by maximizing an index function, either the CH index or the Silhouette index, rather than relying on guessing a pre-specified number. Five agreement indices (Rand index, Hubert and Arabie's adjusted Rand index, Morey and Agresti's adjusted Rand index, Fowlkes and Mallows index and Jaccard index), which measure the degree of agreement between any two partitions, are also provided in clues. In addition to numerical evidence, clues also supplies a deeper insight into the partitioning process with trajectory plots

Crossref

Directory of Open Access Journals

Journal of Statistical Software

Boosting video tracking performance by means of Tabu Search in Intelligent Visual Surveillance Systems

Author: Berlanga de Jesús Antonio
García Jesús
Ivan Dotu
Molina López José Manuel
Patricio Guisado Miguel Ángel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

In this paper, we present a fast and efficient technique for the data association problem applied to visual tracking systems. Visual tracking process is formulated as a combinatorial hypotheses search with a heuristic evaluation function taking into account structural and specific information such as distance, shape, color, etc. We introduce a Tabu Search algorithm which performs a search on an indirect space. A novel problem formulation allows us to transform any solution into the real search space, which is needed for fitness calculation, in linear time. This new formulation and the use of auxiliary structures yields a fast transformation from a blob-to-track assignment space to the real shape and position of tracks space (while calculating fitness in an incremental fashion), which is key in order to produce efficient and fast results. Other previous approaches are based on statistical techniques or on evolutionary algorithms. These techniques are quite efficient and robust although they cannot converge as fast as our approach.This work was supported in part by Projects CICYT TIN2008-06742-C02-02/TSI, CICYT TEC2008-06732-C02-02/TEC, CAM CONTEXTS (S2009/TIC-1485) and DPS2008-07029-C02-02.Publicad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Unsupervised Spike Sorting for Large-Scale, High-Density Multielectrode Arrays

Author: Berdondini L.
Cella Zanacchi F.
Hennig M. H.
Hilgen G.
Kepiro I. E.
Maccione A.
Murino V.
Muthmann J. -O.
Pirmoradian S.
Puente Encinas A.
Ramirez C. J.
Sernagor E.
Sona D.
Sorbaro M.
Ullo S.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

We present a method for automated spike sorting for recordings with high-density, large-scale multielectrode arrays. Exploiting the dense sampling of single neurons by multiple electrodes, an efficient, low-dimensional representation of detected spikes consisting of estimated spatial spike locations and dominant spike shape features is exploited for fast and reliable clustering into single units. Millions of events can be sorted in minutes, and the method is parallelized and scales better than quadratically with the number of detected spikes. Performance is demonstrated using recordings with a 4,096-channel array and validated using anatomical imaging, optogenetic stimulation, and model-based quality control. A comparison with semi-automated, shape-based spike sorting exposes significant limitations of conventional methods. Our approach demonstrates that it is feasible to reliably isolate the activity of up to thousands of neurons and that dense, multi-channel probes substantially aid reliable spike sorting

Archivio della Ricerca - Università di Pisa

Fuzzy-Rough Intrigued Harmonic Discrepancy Clustering

Author: Chao Fei
Deng Ansheng
Qu Yanpeng
Shang Changjing
Shen Qiang
Yang Longzhi
Yue Guanli
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 22/02/2023
Field of study

Aberystwyth Research Portal

Fuzzy-Rough Intrigued Harmonic Discrepancy Clustering

Author: Chao Fei
Deng Ansheng
Qu Yanpeng
Shang Changjing
Shen Qiang
Yang Longzhi
Yue Guanli
Publication venue
Publication date: 22/02/2023
Field of study

Aberystwyth Research Portal

A dynamic key frames approach to object tracking

Author: Wilkens Christopher A
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2008
Field of study

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2008.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Includes bibliographical references (p. 95-96).In this thesis, I present a dynamic key frames algorithm for state estimation from observations. The algorithm uses KL-divergence as a metric to identify the frames that contribute the most information to estimation of the system's current state. The algorithm is first presented in a numerical optimization framework and then developed as an extension to the Condensation algorithm. Finally, I present results from a Matlab simulation of the algorithm.by Christopher A. Wilkens.M.Eng

DSpace@MIT

Hierarchical Visual Content Modelling and Query based on Trees

Author: Setyanto Arief
Publication venue
Publication date: 01/01/2016
Field of study

In recent years, such vast archives of video information have become available that human annotation of content is no longer feasible; automation of video content analysis is therefore highly desirable. The recognition of semantic content in images is a problem that relies on prior knowledge and learnt information and that, to date, has only been partially solved. Salient analysis, on the other hand, is statistically based and highlights regions that are distinct from their surroundings, while also being scalable and repeatable. The arrangement of salient information into hierarchical tree structures in the spatial and temporal domains forms an important step to bridge the semantic salient gap. Salient regions are identified using region analysis, rank ordered and documented in a tree for further analysis. A structure of this kind contains all the information in the original video and forms an intermediary between video processing and video understanding, transforming video analysis to a syntactic database analysis problem. This contribution demonstrates the formulation of spatio-temporal salient trees the syntax to index them, and provides an interface for higher level cognition in machine vision

University of Essex Research Repository

Diposit Digital de Documents de la UAB

An Implementation of the Mean Shift Algorithm

Author
Publication venue: 'Image Processing On Line'
Publication date
Field of study

Crossref