Search CORE

17 research outputs found

Evaluation of Jackknife and Bootstrap for Defining Confidence Intervals for Pairwise Agreement Measures

Author: A Thalamuthu
AC Shore
AN Albatineh
Ana Severiano
B Efron
B Efron
B Mirkin
D. Ashley Robinson
DL Wallace
DS Smyth
EB Fowlkes
EP Smith
Fabio Rapallo
FR Pinto
FR Pinto
Francisco R. Pinto
G Cagney
GA Price
JA Carriço
JF Heltshe
JF Heltshe
JJ Hellmann
João A. Carriço
L Hubert
Mário Ramirez
NA Faria
NH Zaiss
P Jaccard
R Newson
S Zahl
W Smith
WM Rand
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Several research fields frequently deal with the analysis of diverse classification results of the same entities. This should imply an objective detection of overlaps and divergences between the formed clusters. The congruence between classifications can be quantified by clustering agreement measures, including pairwise agreement measures. Several measures have been proposed and the importance of obtaining confidence intervals for the point estimate in the comparison of these measures has been highlighted. A broad range of methods can be used for the estimation of confidence intervals. However, evidence is lacking about what are the appropriate methods for the calculation of confidence intervals for most clustering agreement measures. Here we evaluate the resampling techniques of bootstrap and jackknife for the calculation of the confidence intervals for clustering agreement measures. Contrary to what has been shown for some statistics, simulations showed that the jackknife performs better than the bootstrap at accurately estimating confidence intervals for pairwise agreement measures, especially when the agreement between partitions is low. The coverage of the jackknife confidence interval is robust to changes in cluster number and cluster size distribution

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The stability of co-authorship structures

Author: A Abbasi
A Abbasi
A Ferligoj
A Ferligoj
A Hollis
AN Albatineh
Anuška Ferligoj
B Groboljšek
D Beaver
D Beaver
DJDS Price
DR White
F Lorrain
F Yoshikane
G Laudel
G Melin
J Haan De
J Howells
J Lundberg
J Moody
JD Adams
JS Katz
K Frenken
L Kronegger
L Kronegger
Luka Kronegger
Marjan Cugmas
NE Friedkin
P Doreian
R Lambiotte
RS Burt
S Kyvik
S Lee
SR Borrett
V Batagelj
V Batagelj
V Batagelj
W Glänzel
X Liu
Z Chinchilla-Rodríguez
Z Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Explicit Agreement Extremes for a 2 × 2 Table with Given Marginals

Author: AN Albatineh
D Steinley
H Messatfa
José E. Chacón
L Hubert
LC Morey
MJ Brusco
MJ Warrens
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

k-Means, Ward and Probabilistic Distance-Based Clustering Methods with Contiguity Constraint

Author: A Ben-Israel
A Ferligoj
A Młodak
A Młodak
A Młodak
A Petrucci
AN Albatineh
AN Albatineh
AN Albatineh
Andrzej Młodak
BB Singh
BS Everitt
CS Peirce
E Weiszfeld
GH Ball
HH Kelejian
HW Kuhn
J Kubacki
JC Bezdek
JC Dunn
JC Dunn
JH Ward
JP LeSage
K Wagstaff
L Hubert
M Pratesi
R Mojena
RR Sokal
T Józefowski
U Maulik
W Rand
W Wagner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Controlling and Visualizing the Precision-Recall Tradeoff for External Performance Indices

Author: AK Jain
AN Albatineh
B Hanczar
C Drummond
CD Manning
G Govaert
M Buckland
Marina Sokolova
S Bergmann
S Busygin
S Datta
SC Madeira
Y Cheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/09/2018
Field of study

International audienceIn many machine learning problems, the performance of the results is measured by indices that often combine precision and recall. In this paper, we study the behavior of such indices in function of the tradeoff precision-recall. We present a new tool of performance visualization and analysis referred to the tradeoff space, which plots the performance index in function of the precision-recall tradeoff. We analyse the properties of this new space and show its advantages over the precision-recall space. Code related to this paper is available at: https://sites-google-com.ezproxy.universite-paris-saclay.fr/site/bhanczarhomepage/prerec

HAL Evry

Crossref

Understanding Malvestuto’s normalized mutual information

Author: AK Jain
AN Albatineh
AN Albatineh
C Hennig
CE Shannon
D Pfitzner
D Steinley
D Steinley
E Rendón
FB Baulieu
FM Malvestuto
GW Milligan
GW Milligan
JR Quinlan
L Fisher
L Kaufman
LJ Hubert
M Meilă
M Rezaei
MJ Warrens
MJ Warrens
MJ Warrens
NX Vinh
TO Kvalseth
V Kumar
WM Rand
Y Lei
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Malvestuto’s version of the normalized mutual information is a well-known information theoretic index for quantifying agreement between two partitions. To further our understanding of what information on agreement between the clusters the index may reflect, we study components of the index that contain information on individual clusters, using mathematical analysis and numerical examples. The indices for individual clusters provide useful information on what is going on with specific clusters

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Correcting Jaccard and other similarity indices for chance agreement in cluster analysis

Author: Ahmed N. Albatineh
AK Jain
AN Albatineh
AN Albatineh
BB Lamont
D Steinley
DJ Rogers
DL Wallace
E Van Der Maarel
EB Fowlkes
EL Lehmann
G Milligan
G Milligan
HO Lancaster
J Czekanowski
JC Gower
L Hubert
L Morey
LR Dice
MA Fligner
Magdalena Niewiadomska-Bugaj
P Jaccard
P Jaccard
P Legendre
PC Saxena
PC Saxena
PF Russell
RR Sokal
RR Sokal
S Janson
S Kulczynski
SC Johnson
T Sørensen
TAB Snijders
TS Southwood
U Hamann
W Rand
Z Hubálek
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Understanding the Rand index

Author: A Severiano
AK Dubey
AK Jain
AN Albatineh
AN Albatineh
C Hennig
C Luo
D Pfitzner
D Steinley
D Steinley
DL Wallace
DT Anderson
EB Fowlkes
FB Baulieu
GW Milligan
GW Milligan
L Kaufman
LJ Hubert
M Brun
M Meilă
M Rezaei
MJ Warrens
MJ Warrens
MJ Warrens
MJ Warrens
MJ Warrens
MJ Warrens
NX Vinh
P Katiyar
RR Sokal
S Zeng
V Kumar
WJ Heiser
WM Rand
Z Huo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

The Rand index continues to be one of the most popular indices for assessing agreement between two partitions. The Rand index combines two sources of information, object pairs put together, and object pairs assigned to different clusters, in both partitions. Via a decomposition of the Rand index into four asymmetric indices, we show that in many situations object pairs that were assigned to different clusters have considerable impact on the value of the overall Rand index.<br/

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Inequalities Between Similarities for Numerical Data

Author: AF ZUUR
AN ALBATINEH
BM CAMPBELL
FB BAULIEU
H Wolda
JC GOWER
JR BRAY
M-J LESOT
Matthijs J. Warrens
MJ WARRENS
MM DEZA
S Cha
U FECHNER
V BATAGELJ
V Huhta
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Adjusted Concordance Index: an Extensionl of the Adjusted Rand Index to Fuzzy Partitions

Author: A Ben-Israel
A Suleman
AK Jain
AK Jain
AN Albatineh
AN Albatineh
BS Duran
BS Everitt
D Stahl
DT Anderson
E Hüllermeier
EB Fowlkes
EH Ruspini
EP Klement
F Höppner
F Pesarin
F Pesarin
H Frigui
H Spath
HH Böck
J Han
JA Hartigan
JC Bezdek
JC Gower
L Hubert
L Hubert
L Kaufman
LC Morey
LR Dice
M Downton
M Meilă
MJ Warrens
MJ Warrens
MR Anderberg
P Jaccard
RJ Campello
RK Brouwer
S Kulczynski
WM Rand
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref