Search CORE

17 research outputs found

Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm

Over the past five decades, k-means has become the clustering algorithm of choice in many application domains primarily due to its simplicity, time/space efficiency, and invariance to the ordering of the data points. Unfortunately, the algorithm's sensitivity to the initial selection of the cluster centers remains to be its most serious drawback. Numerous initialization methods have been proposed to address this drawback. Many of these methods, however, have time complexity superlinear in the number of data points, which makes them impractical for large data sets. On the other hand, linear methods are often random and/or sensitive to the ordering of the data points. These methods are generally unreliable in that the quality of their results is unpredictable. Therefore, it is common practice to perform multiple runs of such methods and take the output of the run that produces the best results. Such a practice, however, greatly increases the computational requirements of the otherwise highly efficient k-means algorithm. In this chapter, we investigate the empirical performance of six linear, deterministic (non-random), and order-invariant k-means initialization methods on a large and diverse collection of data sets from the UCI Machine Learning Repository. The results demonstrate that two relatively unknown hierarchical initialization methods due to Su and Dy outperform the remaining four methods with respect to two objective effectiveness criteria. In addition, a recent method due to Erisoglu et al. performs surprisingly poorly.Comment: 21 pages, 2 figures, 5 tables, Partitional Clustering Algorithms (Springer, 2014). arXiv admin note: substantial text overlap with arXiv:1304.7465, arXiv:1209.196

arXiv.org e-Print Archive

Crossref

Bioavailable iron in the Southern Ocean: the significance of the iceberg conveyor belt

Author: AP Lisitzin
BM Loscher
C Hyacinthe
CP Slomp
D Lannuzel
DA Hutchins
DE Janney
DE Janney
G Sposito
H Meguro
HJW de Baar
HJW de Baar
HW Rich
IY Fung
J McManus
JA Church
JE Kostka
JF Banfield
JH Martin
JH Martin
JL Hand
JWM Wijsman
K Barbeau
KL Smith
KS Johnson
KS Johnson
Liane G Benning
M Chen
Martyn Tranter
ML Wells
ML Wells
ML Wells
N Cassar
N Yee
P Berg
P Boyd
PJ Lam
PN Sedwick
R De'ath
R Edwards
R Raiswell
R Raiswell
R Raiswell
RA Duce
RCL Wilson
RM Cornell
Rob Raiswell
S Blain
S Shaw
S-M Fan
Slawek Tulaczyk
SW Poulton
SW Poulton
SW Poulton
TA Scambos
TD Jickells
TD Jickells
U Schwertmann
VA Elrod
W Berelson
WS Patterson
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Productivity in the Southern Oceans is iron-limited, and the supply of iron dissolved from aeolian dust is believed to be the main source from outside the marine reservoir. Glacial sediment sources of iron have rarely been considered, as the iron has been assumed to be inert and non-bioavailable. This study demonstrates the presence of potentially bioavailable Fe as ferrihydrite and goethite in nanoparticulate clusters, in sediments collected from icebergs in the Southern Ocean and glaciers on the Antarctic landmass. Nanoparticles in ice can be transported by icebergs away from coastal regions in the Southern Ocean, enabling melting to release bioavailable Fe to the open ocean. The abundance of nanoparticulate iron has been measured by an ascorbate extraction. This data indicates that the fluxes of bioavailable iron supplied to the Southern Ocean from aeolian dust (0.01–0.13 Tg yr-1) and icebergs (0.06–0.12 Tg yr-1) are comparable. Increases in iceberg production thus have the capacity to increase productivity and this newly identified negative feedback may help to mitigate fossil fuel emissions

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Explore Bristol Research

Drug-drug interactions and QT prolongation as a commonly assessed cardiac effect - comprehensive overview of clinical trials

Author: A Hemeryck
A Kalliokoski
A Selzer
A Shehab
AD Haarst van
AJ Camm
AJ Davies
AJ Kaumann
AK Miller
AL Fenrich
AL Kovac
AM Brown
ASY Chain
B Charbit
B Darpo
B Darpo
B Szabo
B Tyl
B Wisniowska
BA Sproule
BM VandenBrink
BP Monahan
BR Brooks
C Banfield
C Banfield
C Funck‐Brentano
C Funck‐Brentano
C Graff
C Lu
CA Dackis
CB Eap
CB Nemeroff
CC Chu
CT January
CU Correll
D Martin
D Razzouk
D Shaffer
DK Wysowski
DK Wysowski
DM Roden
DR Abernethy
E Azim
EK Heist
EM Antman
EP Harrigan
EP Pioro
European Medicines Agency
F Dessertenne
F Nosten
F Ponti De
F Simons
G Lefevre
GG Belz
H Karunajeewa
H Laverty
H Sijs van der
H Suessbrich
H Zhou
HM Jones
HR Mellor
HS Panitch
J Alderman
J Demolis
J Hochman
J König
J Manning
J Morganroth
J Schmittner
JA Zix
JC Bouvy
JC Hancox
JD Zeuli
JE Tisdale
JF Carlquist
JH Lee
JL Goren
JP Piccini
JR Baker
JY Park
K Bachmann
K Theisen
K Wenckebach
K Wenzel-Seifert
KA Schoedel
KH Haugaa
KS Lee
KS Lim
KS Lim
L Saarnivaara
LC Wienkers
LH Curtis
LL Moltke von
M Affrime
M Akhtar
M Bindschedler
M Desai
M Hinder
M Robert
M Roy
M Sala
MA Gibbs
MB Lewin
MD Brannan
MJ Boyce
MK Laufer
ML Bruin De
MT Chan
N Ferri
N Valecha
N Yoshida
O Aina
OT Mytton
P Chaikin
P Coyne
P Glue
P Kam De
P Liu
PJ Kam de
PK Honig
PK Honig
PK Honig
PK Honig
PK Honig
PT Sager
Q Zhao
R Kempsford
R Tartini
RA Carr
RA Lefebvre
RF Bergstrom
RK Gupta
RL Woosley
S Bran
S Gupta
S Harris
S Hennessy
S Krudsood
S Olsson
S Pukrittayakamee
SI Omoruyi
SM Huang
T Force
T Katoh
T Kosoglou
T Shepard
TA Collins
TJ Gan
TM Craft
WS Redfern
WS Redfern
X Delavenne
Y Kurata
YG Yap
Z Desta
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Glucocorticoids contribute to the heritability of leptin in Scottish adult female twins

Author: Banfield E
Connell JMC
Fraser R
Hillis WS
Ingram M
Swan L
Wallace AM
Publication venue: 'Wiley'
Publication date: 01/07/2004
Field of study

Enlighten

Specific Variants in the MLH1 Gene Region May Drive DNA Methylation, Loss of Protein Expression, and MSI-H Colorectal Cancer

Author: Aaron Pollett
Alfons Navarro
Andrew D. Paterson
Bharati Bapat
Brent W. Zanke
BW Zanke
CA Eads
Celia M. Greenwood
CM Ribic
CR Boland
D Firth
Darshana Daftary
DJ Weisenberger
Elizabeth Dicks
FL Green
GR Abecasis
H Chen
H Hampel
H. Banfield Younghusband
I Harley
J Li
J Liu
JG Herman
JH Yu
JM Allan
JM Carethers
JM Carethers
JN Poynter
John D. Potter
John R. McLaughlin
K Thorsen
KJ Livak
LJ Worrillow
M Cotterchio
M Ilyas
M Mrkonjic
MA Jenkins
ME Beiner
Miralem Mrkonjic
ML Veigl
MO Woods
MP Hitchins
Nicole M. Roslin
NM Lindor
PA Newcomb
Patrick S. Parfrey
Peter W. Laird
Polly A. Newcomb
PT Campbell
R Jover
Roger C. Green
S Raptis
S Winawer
Stavroula Raptis
Stephen N. Thibodeau
Steven Gallinger
Theodore Chiang
Thomas J. Hudson
Vaijayanti V. Pethe
W Yu
WH de Vos tot Nederveen Cappel
WM Grady
WS Samowitz
WS Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/10/2010
Field of study

Background: We previously identified an association between a mismatch repair gene, MLH1, promoter SNP (rs1800734) and microsatellite unstable (MSI-H) colorectal cancers (CRCs) in two samples. The current study expanded on this finding as we explored the genetic basis of DNA methylation in this region of chromosome 3. We hypothesized that specific polymorphisms in the MLH1 gene region predispose it to DNA methylation, resulting in the loss of MLH1 gene expression, mismatch-repair function, and consequently to genome-wide microsatellite instability. Methodology/Principal Findings: We first tested our hypothesis in one sample from Ontario (901 cases, 1,097 controls) and replicated major findings in two additional samples from Newfoundland and Labrador (479 cases, 336 controls) and from Seattle (591 cases, 629 controls). Logistic regression was used to test for association between SNPs in the region of MLH1 and CRC, MSI-H CRC, MLH1 gene expression in CRC, and DNA methylation in CRC. The association between rs1800734 and MSI-H CRCs, previously reported in Ontario and Newfoundland, was replicated in the Seattle sample. Two additional SNPs, in strong linkage disequilibrium with rs1800734, showed strong associations with MLH1 promoter methylation, loss of MLH1 protein, and MSI-H CRC in all three samples. The logistic regression model of MSI-H CRC that included MLH1-promotermethylation status and MLH1 immunohisotchemistry status fit most parsimoniously in all three samples combined. When rs1800734 was added to this model, its effect was not statistically significant (P-value = 0.72 vs. 2.361024 when the SNP was examined alone). Conclusions/Significance: The observed association of rs1800734 with MSI-H CRC occurs through its effect on the MLH1 promoter methylation, MLH1 IHC deficiency, or both

Public Library of Science (PLOS)

Crossref

Memorial University Research Repository

Directory of Open Access Journals

PubMed Central

FigShare