Search CORE

5,877 research outputs found

Noise resistant generalized parametric validity index of clustering for gene expression data

Author: Fa R
Nandi AK
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2014
Field of study

This article has been made available through the Brunel Open Access Publishing Fund.Validity indices have been investigated for decades. However, since there is no study of noise-resistance performance of these indices in the literature, there is no guideline for determining the best clustering in noisy data sets, especially microarray data sets. In this paper, we propose a generalized parametric validity (GPV) index which employs two tunable parameters α and β to control the proportions of objects being considered to calculate the dissimilarities. The greatest advantage of the proposed GPV index is its noise-resistance ability, which results from the flexibility of tuning the parameters. Several rules are set to guide the selection of parameter values. To illustrate the noise-resistance performance of the proposed index, we evaluate the GPV index for assessing five clustering algorithms in two gene expression data simulation models with different noise levels and compare the ability of determining the number of clusters with eight existing indices. We also test the GPV in three groups of real gene expression data sets. The experimental results suggest that the proposed GPV index has superior noise-resistance ability and provides fairly accurate judgements

Crossref

Brunel University Research Archive

Speaker segmentation and clustering

Author: Ajmera
Ajmera
Almpanidis
Barras
Bimbot
Campbell
Campbell
Cettolo
Constantine Kotropoulos
Delacourt
Deller
Fiscus
Gales
Garofolo
Godfrey
Graff
Graff
Graff
Hansen
Harb
Hess
Huang
Jain
Kim
Know
Lapidot
Lu
Manjunath
Margarita Kotti
Meignier
Oppenheim
Pellom
Reynolds
Sondhi
Tranter
Vassiliki Moschou
Ververidis
Wang
Wu
Wu
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

This survey focuses on two challenging speech processing topics, namely: speaker segmentation and speaker clustering. Speaker segmentation aims at finding speaker change points in an audio stream, whereas speaker clustering aims at grouping speech segments based on speaker characteristics. Model-based, metric-based, and hybrid speaker segmentation algorithms are reviewed. Concerning speaker clustering, deterministic and probabilistic algorithms are examined. A comparative assessment of the reviewed algorithms is undertaken, the algorithm advantages and disadvantages are indicated, insight to the algorithms is offered, and deductions as well as recommendations are given. Rich transcription and movie analysis are candidate applications that benefit from combined speaker segmentation and clustering. © 2007 Elsevier B.V. All rights reserved

CiteSeerX

Crossref

Spiral - Imperial College Digital Repository

Electricity clustering framework for automatic classification of customer loads

Author: Biscarri Triviño Félix
García Delgado Antonio
Guerrero Alonso Juan Ignacio
León de Mora Carlos
Monedero Goicoechea Iñigo Luis
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Clustering in energy markets is a top topic with high significance on expert and intelligent systems. The main impact of is paper is the proposal of a new clustering framework for the automatic classification of electricity customers’ loads. An automatic selection of the clustering classification algorithm is also highlighted. Finally, new customers can be assigned to a predefined set of clusters in the classificationphase. The computation time of the proposed framework is less than that of previous classification tech- niques, which enables the processing of a complete electric company sample in a matter of minutes on a personal computer. The high accuracy of the predicted classification results verifies the performance of the clustering technique. This classification phase is of significant assistance in interpreting the results, and the simplicity of the clustering phase is sufficient to demonstrate the quality of the complete mining framework.Ministerio de Economía y Competitividad TEC2013-40767-RMinisterio de Economía y Competitividad IDI- 2015004

idUS. Depósito de Investigación Universidad de Sevilla

Hierarchical information clustering by means of topologically embedded graphs

Author: Aste Tomaso
Matteo T. Di
Song Won-Min
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 10/12/2015
Field of study

We introduce a graph-theoretic approach to extract clusters and hierarchies in complex data-sets in an unsupervised and deterministic manner, without the use of any prior information. This is achieved by building topologically embedded networks containing the subset of most significant links and analyzing the network structure. For a planar embedding, this method provides both the intra-cluster hierarchy, which describes the way clusters are composed, and the inter-cluster hierarchy which describes how clusters gather together. We discuss performance, robustness and reliability of this method by first investigating several artificial data-sets, finding that it can outperform significantly other established approaches. Then we show that our method can successfully differentiate meaningful clusters and hierarchies in a variety of real data-sets. In particular, we find that the application to gene expression patterns of lymphoma samples uncovers biologically significant groups of genes which play key-roles in diagnosis, prognosis and treatment of some of the most relevant human lymphoid malignancies

The Australian National University

Relational visual cluster validity

Author: Ding Y.
Harrison R.F.
Publication venue: 'Elsevier BV'
Publication date: 01/11/2007
Field of study

The assessment of cluster validity plays a very important role in cluster analysis. Most commonly used cluster validity methods are based on statistical hypothesis testing or finding the best clustering scheme by computing a number of different cluster validity indices. A number of visual methods of cluster validity have been produced to display directly the validity of clusters by mapping data into two- or three-dimensional space. However, these methods may lose too much information to correctly estimate the results of clustering algorithms. Although the visual cluster validity (VCV) method of Hathaway and Bezdek can successfully solve this problem, it can only be applied for object data, i.e. feature measurements. There are very few validity methods that can be used to analyze the validity of data where only a similarity or dissimilarity relation exists – relational data. To tackle this problem, this paper presents a relational visual cluster validity (RVCV) method to assess the validity of clustering relational data. This is done by combining the results of the non-Euclidean relational fuzzy c-means (NERFCM) algorithm with a modification of the VCV method to produce a visual representation of cluster validity. RVCV can cluster complete and incomplete relational data and adds to the visual cluster validity theory. Numeric examples using synthetic and real data are presente

White Rose Research Online

Hierarchical information clustering by means of topologically embedded graphs

Author: A Alizadeh
A Jain
AI Saez
AJ Nathalie
BB Ding
C Rivera
D Arthur
D Garlaschelli
DL Davies
DM Rocke
G Caldarelli
G Lenz
G Ringel
G Romeo
GL Pellegrini
GP Coffey
H Hooyberghs
IS Lossos
IT Hernádvölgyi
J Dunn
J Handl
J McQueen
J Quackenbush
J Ruan
J Shi
J Wang
JM Boyer
JS Abramson
JSJ Andrade
KII Goh
L Amaral
L Chen
L Hubert
L Leseux
LL Lam
M Arsura
M Eisen
M Filipits
M Girvan
M Kitsak
M Tumminello
MC de Souto
N Wada
PF Jonsson
R Diestel
R Seki
R Xu
RA Fisher
S Fortunato
ShaunS Wang
SV Buldyrev
T Aste
T Di Matteo
T Di Matteo
T Di Matteo
T Kamijo
T Kohonen
T Sorensen
T. Di Matteo
Tomaso Aste
U von Luxburg
WM Song
Won-Min Song
X Zhao
XF Zhao
Ying Xu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 20/10/2011
Field of study

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

PubMed Central

Kent Academic Repository

King's Research Portal

FigShare