Search CORE

118 research outputs found

The Minkowski central partition as a pointer to a suitable distance exponent and consensus partitioning

Author: Andrei Shestakov
Arbelaitz
Bertoni
Boris Mirkin
Caliński
Chan
de Amorim
de Amorim
de Amorim
de Amorim
Field
Hadjitodorov
Hartigan
Huang
Huang
Huang
Hubert
Jain
Ji
Kuncheva
Legendre
MacQueen
Makarenkov
MATLAB
Milligan
Mirkin
Mirkin
Ng
Pividori
Pollard
Renato Cordeiro de Amorim
Rousseeuw
Saitou
Steinley
Steinley
Topchy
Vladimir Makarenkov
Von Luxburg
Yang
Publication venue: 'Elsevier BV'
Publication date: 02/02/2017
Field of study

The Minkowski weighted K-means (MWK-means) is a recently developed clustering algorithm capable of computing feature weights. The cluster-specific weights in MWK-means follow the intuitive idea that a feature with low variance should have a greater weight than a feature with high variance. The final clustering found by this algorithm depends on the selection of the Minkowski distance exponent. This paper explores the possibility of using the central Minkowski partition in the ensemble of all Minkowski partitions for selecting an optimal value of the Minkowski exponent. The central Minkowski partition appears to be also a good consensus partition. Furthermore, we discovered some striking correlation results between the Minkowski profile, defined as a mapping of the Minkowski exponent values into the average similarity values of the optimal Minkowski partitions, and the Adjusted Rand Index vectors resulting from the comparison of the obtained partitions to the ground truth. Our findings were confirmed by a series of computational experiments involving synthetic Gaussian clusters and real-world data

University of Essex Research Repository

Crossref

Birkbeck Institutional Research Online

University of Hertfordshire Research Archive

Sparse p-Adic Data Coding for Computationally Efficient and Effective Big Data Analytics

Author: A. Berger
A. Ng
A. Rodionov
A. Rodionov
A. Yu. Khrennikov
B. Dragovich
B. Mirkin
B. Mirkin
D. W. Jones
F. Benford
F. Murtagh
F. Murtagh
F. Murtagh
F. Murtagh
F. Murtagh
F. Murtagh
F. Murtagh
F. Murtagh
F. Murtagh
F. Q. Gouvea
H. A. Simon
K. Hayashi
M. Krasner
O. Erdem
P. Contreras
P. E. Bradley
P. Hall
R. G. Baraniuk
T. P. Hill
W. H. Schikhof
Y. Linde
Publication venue: 'Pleiades Publishing Ltd'
Publication date: 01/01/2016
Field of study

We develop the theory and practical implementation of p-adic sparse coding of data. Rather than the standard, sparsifying criterion that uses the

L_0

pseudo-norm, we use the p-adic norm.We require that the hierarchy or tree be node-ranked, as is standard practice in agglomerative and other hierarchical clustering, but not necessarily with decision trees. In order to structure the data, all computational processing operations are direct reading of the data, or are bounded by a constant number of direct readings of the data, implying linear computational time. Through p-adic sparse data coding, efficient storage results, and for bounded p-adic norm stored data, search and retrieval are constant time operations. Examples show the effectiveness of this new approach to content-driven encoding and displaying of data

arXiv.org e-Print Archive

Goldsmiths Research Online

Crossref

UDORA - University of Derby Online Research Archive

Colloids with valence and specific directional bonding

Author: A Perro
A Walther
AB Pawar
AB Pawar
Andrew D. Hollingsworth
AP Alivisatos
CA Mirkin
CI Zoldesi
D Frenkel
D Nykypanchuk
D Zerrouki
Dana R. Breed
David J. Pine
DJ Kraft
DJ Kraft
DR Nelson
E Bianchi
F Huo
F Li
F Romano
G Meng
G Zhang
J Ugelstad
J Ugelstad
KM Ho
KP Velikov
L Hong
L Rossi
Lang Feng
MA Miller
Marcus Weck
ME Leunissen
ME Leunissen
MR Jones
NG van Kampen
PM Johnson
Q Chen
Q Chen
R Dreyfus
R Ottewill
RJ Macfarlane
S Sacanna
S Sacanna
SC Glotzer
SC Glotzer
Vinothan N. Manoharan
VN Manoharan
X Xu
Y-S Cho
Yu Wang
Yufeng Wang
Z Zhang
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Community detection in graphs

Author: Adamcsek
Adomavicius
Agarwal
Agrawal
Ahuja
Akaike
Alba
Albert
Albert
Albert
Alves
Andrews
Arenas
Arenas
Arenas
Arenas
Arenas
Asahiro
Ashburner
Asur
Backstrom
Baeza-Yates
Bagrow
Balakrishnan
Bansal
Barabási
Barahona
Barber
Barber
Barber
Barnes
Barrat
Barrat
Baumes
Beirlant
Berg
Bezdek
Bhatia
Bianconi
Bianconi
Bianconi
Biernacki
Blake
Blatt
Blondel
Boccaletti
Boccaletti
Boettcher
Bollobas
Bomze
Bonacich
Bonacich
Bonanno
Bonanno
Borgatti
Brandes
Brandes
Brin
Bron
Burnham
Burt
Capocci
Castellano
Chakrabarti
Chakrabarti
Chan
Chandra
Chen
Chen
Chi
Chung
Clauset
Clauset
Clauset
Clauset
Cohen
Coleman
Condon
Csermely
Danon
Danon
Danon
David
Davis
de Solla Price
de Solla Price
Demmel
Dempster
Derényi
Dhillon
Djidjev
Donath
Donetti
Donetti
Doreian
Dorogovtsev
Dourisboure
Du
Du
Duch
Dunbar
Dunn
Dunn
Earl
Eckmann
Efron
Elias
Erdös
Eriksen
Estrada
Estrada
Euler
Evans
Everett
Everett
Fan
Farkas
Farutin
Feige
Feng
Fenn
Fiedler
Fiedler
Fienberg
Flake
Flake
Ford
Fortunato
Fortunato
Fortunato
Fortunato
Fouss
Fowlkes
Freeman
Freeman
Fu
Gaertler
Gallager
Gan
Garey
Gfeller
Gfeller
Gfeller
Giles
Girvan
Gleiser
Glover
Goldberg
Golub
Gori
Granovetter
Gregory
Gregory
Grünwald
Gudkov
Guimerà
Guimerà
Guimerà
Guimerà
Guimerà
Guimerà
Gusfield
Gustafsson
Gómez
Hagen
Handcock
Harel
Hastie
Hastings
Heimo
Hillier
Hlaoui
Hofman
Holland
Holland
Holme
Holmström
Holzapfel
Homans
Hopcroft
Hu
Hu
Huffman
Hughes
Ispolatov
Itzkovitz
Jonsson
Jordan
Junker
Karloff
Karrer
Kernighan
Kirkpatrick
Klein
Kleinberg
Koskinen
Kottak
Krause
Krawczyk
Krishnamurthy
Kullback
Kumar
Kumar
Kumpula
Kumpula
Kumpula
Kuramoto
Lancichinetti
Lancichinetti
Lancichinetti
Lanczos
Latapy
Latora
Lehmann
Lehmann
Leicht
Leskovec
Leskovec
Leung
Li
Li
Liben-Nowell
Lin
Liu
Lloyd
Long
Lorrain
Lovász
Luccio
Luce
Luce
Lusseau
Mackay
MacQueen
Mancoridis
Mantegna
Mantegna
Massen
Matsuda
Matula
Medus
Mei
Meilă
Meilă
Mendes
Mezard
Middleton
Milgram
Milo
Mirkin
Mitrović
Mokken
Molloy
Moody
Muff
Mézard
Nadler
Nepusz
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Ng
Nicosia
Nishikawa
Noack
Noack
Noh
Noh
Ohkubo
Onnela
Onnela
Orponen
Palla
Palla
Papadimitriou
Pastor-Satorras
Pastor-Satorras
Peeters
Perkins
Peterson
Pikovsky
Pimm
Pinney
Pluchino
Pollner
Porter
Porter
Porter
Pujol
Pólya
Radicchi
Raghavan
Ramasco
Rand
Rattigan
Rattigan
Ravasz
Ravasz
Reddy
Reichardt
Reichardt
Reichardt
Reichardt
Reichardt
Reichardt
Reichardt
Ren
Rhodes
Rice
Richardson
Rissanen
Rives
Rodrigues
Ronhovde
Rosvall
Rosvall
Rowicka
Ruan
Ruan
Sales-Pardo
Santo Fortunato
Sawardecker
Schaeffer
Schenker
Schuetz
Schuetz
Schwarz
Scott
Seidman
Seidman
Sen
Shen
Shen
Sherrington
Shi
Shi
Simon
Simon
Simonsen
Simonsen
Slanina
Snijders
Solomonoff
Son
Spirin
Stanley
Steenstrup
Stewart
Suaris
Sun
Sun
Tibély
Tong
Traag
Travers
Tyler
Vazquez
Vragović
Wallace
Wallace
Wang
Ward
Wasserman
Watts
Watts
Wei
Weinan
Weiss
White
White
Wilkinson
Williams
Winkler
Wu
Wu
Wu
Xiang
Xu
Yang
Ye
Yen
Zachary
Zanghi
Zarei
Zhang
Zhang
Zhang
Zhang
Zhou
Zhou
Zhou
Zhou
Ziv
Łuczak
Šíma
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

The modern science of networks has brought significant advances to our understanding of complex systems. One of the most relevant features of graphs representing real systems is community structure, or clustering, i. e. the organization of vertices in clusters, with many edges joining vertices of the same cluster and comparatively few edges joining vertices of different clusters. Such clusters, or communities, can be considered as fairly independent compartments of a graph, playing a similar role like, e. g., the tissues or the organs in the human body. Detecting communities is of great importance in sociology, biology and computer science, disciplines where systems are often represented as graphs. This problem is very hard and not yet satisfactorily solved, despite the huge effort of a large interdisciplinary community of scientists working on it over the past few years. We will attempt a thorough exposition of the topic, from the definition of the main elements of the problem, to the presentation of most methods developed, with a special focus on techniques designed by statistical physicists, from the discussion of crucial issues like the significance of clustering and how methods should be tested and compared against each other, to the description of applications to real networks.Comment: Review article. 103 pages, 42 figures, 2 tables. Two sections expanded + minor modifications. Three figures + one table + references added. Final version published in Physics Report

arXiv.org e-Print Archive

CiteSeerX

Crossref

RecG directs DNA synthesis during double-strand break repair

Author: A Khlebnikov
A Taylor
A. M. Mahedi Hasan
AA Al-Deib
AL De Septenville
B Azeroglu
B Bhattacharyya
B Liu
B Michel
Benura Azeroglu
BM Wendel
C Merlin
CA Cockram
Charlotte A. Cockram
CJ Cadman
CJ Rudolph
CJ Rudolph
CJ Rudolph
CJ Rudolph
David R. F. Leach
DG Anderson
EV Mirkin
FR Blattner
GR Smith
GR Smith
H Masai
H Merrikh
J Gowrishankar
J Liu
JA Buss
JK Eykelenboom
JS Mawer
Julia S. P. Mawer
Justin Courcelle
JW Nicol
JY Ng
KH Zavitz
KJ Marians
KM Muskavitch
L Wardrope
M Elias-Arnanz
M Lopper
M Manosas
Martin A. White
MC Whitby
MC Whitby
Milana Filatenkova
MS Dillingham
P McGlynn
P McGlynn
P McGlynn
PR Bianco
R Betous
R Rothstein
RG Lloyd
RG Lloyd
RG Lloyd
RG Lloyd
RL Maher
RP Jaktaji
S Abd Wahab
S Gupta
S Gupta
S Webb
SK Amundsen
SR Khan
SR Wessel
T Tanaka
TM Hill
TN Mandal
TR Meddows
YH Huang
YH Huang
Z Liu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/02/2016
Field of study

Homologous recombination provides a mechanism of DNA double-strand break repair (DSBR) that requires an intact, homologous template for DNA synthesis. When DNA synthesis associated with DSBR is convergent, the broken DNA strands are replaced and repair is accurate. However, if divergent DNA synthesis is established, over-replication of flanking DNA may occur with deleterious consequences. The RecG protein of Escherichia coli is a helicase and translocase that can re-model 3-way and 4-way DNA structures such as replication forks and Holliday junctions. However, the primary role of RecG in live cells has remained elusive. Here we show that, in the absence of RecG, attempted DSBR is accompanied by divergent DNA replication at the site of an induced chromosomal DNA double-strand break. Furthermore, DNA double-stand ends are generated in a recG mutant at sites known to block replication forks. These double-strand ends, also trigger DSBR and the divergent DNA replication characteristic of this mutant, which can explain over-replication of the terminus region of the chromosome. The loss of DNA associated with unwinding joint molecules previously observed in the absence of RuvAB and RecG, is suppressed by a helicase deficient PriA mutation (priA300), arguing that the action of RecG ensures that PriA is bound correctly on D-loops to direct DNA replication rather than to unwind joint molecules. This has led us to put forward a revised model of homologous recombination in which the re-modelling of branched intermediates by RecG plays a fundamental role in directing DNA synthesis and thus maintaining genomic stability

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

FigShare

Perinatal (fetal and neonatal) astrocytoma: a review

Author: A Bader
A Borit
A Boukas
A Das
A Ng
A Ortega Aznar
A Seker
A Takaku
A Thankamony
AG Ostör
AJ Sievert
AM Leins
AM Stark
AT Reddy
AV Geraghty
B Connolly
B Manoranjan
B Thiessen
BL Pizer
BW Scheithauer
C Colby
C Mallucci
CH Rickert
D Hargrave
DJ Brat
DN Louis
DY Lee
E Estlin
E Turkmen
G Riboni
G Trehan
GM Milano
GP Raju
H Carstensen
H Gerlach
H Isaacs
H Isaacs
H Isaacs
H Isaacs
H Isaacs Jr
Hart Isaacs
HG Krouwer
HJ Shin
IH Tekkök
J Bonner
JH Phi
JI Iruretagoyena
JJ Volpe
JL Winters
JM Bonnin
JP Chadarévian de
JW Bell
JW Wheless
K Jellinger
K Kotulska
K Sugiyama
K Watanabe
LA Ramenghi
LD Mirkin
LF Bleggi Torres
LM Sabet
M Bekiesinska Figatowska
M Cassart
M Dören
M Kamitomo
M Nozaki
M Severino
ME Macy
MJ Painter
N Roosen
NS McConachie
P Bailey
P Kleihues
P Rout
PC Buetow
PC Burger
R Jooma
R Navarro
RL Heideman
S Gu
S Heckel
S Oi
S Oikawa
S Wakai
S Yamashita
SM Rothman
SN Setty
SO Anteby
SR Vanden Berg
V Prakash
VM Riccardi
W Jänisch
Y Shafrir
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Laplacian normalization for deriving thematic fuzzy clusters with an additive spectral approach

Author: Abbassi
Bezdek
Brouwer
Casillas
Castellano
Davé
Drobics
Hartigan
Hathaway
Hathaway
Hubert
Inoue
Krishnapuram
Masullia
Meyer
Mirkin
Mirkin
Mirkin
Mirkin
Nadler
Nasraoui
Ng
Pal
Popescu
Prelić
Roubens
Runkler
Sato
Shepard
Shi
Sledge
Von Luxburg
Windham
Xu
Yang
Yang
Zelnik-Manor
Zhang
Zimmermann
Publication venue: 'Wiley'
Publication date: 01/01/2013
Field of study

This paper presents a further investigation into computational properties of a novel fuzzy additive spectral clustering method, Fuzzy Additive Spectral clustering (FADDIS), recently introduced by authors. Specifically, we extend our analysis to ‘difficult’ data structures from the recent literature and develop two synthetic data generators simulating affinity data of Gaussian clusters and genuine additive similarity data, with a controlled level of noise. The FADDIS is experimentally verified on these data in comparison with two state-of-the-art fuzzy clustering methods. The claimed ability of FADDIS to help in determining the right number of clusters is experimentally tested, and the role of the pseudo-inverse Laplacian data transformation in this is highlighted. A potentially useful extension of the method to biclustering is introduced

Crossref

Birkbeck Institutional Research Online

A hybrid cluster-lift method for the analysis of research activities

Author: A. Ng
A. Skarman
B. Mirkin
B. Mirkin
D. Gaevic
J. Bezdek
J. Liu
J. Shi
L. Yang
M. Feather
M. Graña
R.N. Shepard
U. Luxburg von
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

A hybrid of two novel methods - additive fuzzy spectral clustering and lifting method over a taxonomy - is applied to analyse the research activities of a department. To be specific, we concentrate on the Computer Sciences area represented by the ACM Computing Classification System (ACM-CCS), but the approach is applicable also to other taxonomies. Clusters of the taxonomy subjects are extracted using an original additive spectral clustering method involving a number of model-based stopping conditions. The clusters are parsimoniously lifted then to higher ranks of the taxonomy by minimizing the count of “head subjects” along with their “gaps” and “offshoots”. An example is given illustrating the method applied to real-world data

Crossref

Birkbeck Institutional Research Online

Rapid Multiplexed Genotyping of Simple Tandem Repeats using Capture and High-Throughput Sequencing

Author: Borel
Burgner
Campbell
Ellegren
Fondon
Fonville
Gemayel
Gymrek
Highnam
Kondrashov
Lander
Mirkin
Molla
Ng
Pang
Shanahan
Siwach
Sun
Vinces
Voorbij
Weber
Xu
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Beads on a string: structure of bound aggregates of globular particles and long polymer chains

Author: Alexander
Anton Souslov
Boehm
Cabane
Cowman
de Gennes
Dickinson
Evanko
Flory
Goddard
Hansson
Hardingham
Hardingham
Jenkins
Jennifer E. Curtis
McLane
Mirkin
Mörgelin
Nap
Ng
Ng
Paul M. Goldbart
Pincus
Pincus
Ricciardelli
Toole
Tuinier
Wang
Wight
Yamaguchi
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 01/01/2015
Field of study

Crossref