Search CORE

8,838 research outputs found

Sub-structural Niching in Estimation of Distribution Algorithms

Author: Abbass H. A.
Goldberg D. E.
Johnson D. D.
Sastry K.
Publication venue
Publication date: 01/01/2005
Field of study

We propose a sub-structural niching method that fully exploits the problem decomposition capability of linkage-learning methods such as the estimation of distribution algorithms and concentrate on maintaining diversity at the sub-structural level. The proposed method consists of three key components: (1) Problem decomposition and sub-structure identification, (2) sub-structure fitness estimation, and (3) sub-structural niche preservation. The sub-structural niching method is compared to restricted tournament selection (RTS)--a niching method used in hierarchical Bayesian optimization algorithm--with special emphasis on sustained preservation of multiple global solutions of a class of boundedly-difficult, additively-separable multimodal problems. The results show that sub-structural niching successfully maintains multiple global optima over large number of generations and does so with significantly less population than RTS. Additionally, the market share of each of the niche is much closer to the expected level in sub-structural niching when compared to RTS

arXiv.org e-Print Archive

CiteSeerX

Fast Differentially Private Matrix Factorization

Author: Ahn S.
Chen T.
Ding N.
Hartstein A.
Keshavan R.
Kyrola A.
Marsaglia G.
Meka R.
Mir D. J.
Neal R. M.
Niu F.
Sato I.
Srebro N.
Wang Y.-X.
Wang Y.-X.
Welling M.
Xin Y.
Zhao H.
Publication venue
Publication date: 07/05/2015
Field of study

Differentially private collaborative filtering is a challenging task, both in terms of accuracy and speed. We present a simple algorithm that is provably differentially private, while offering good performance, using a novel connection of differential privacy to Bayesian posterior sampling via Stochastic Gradient Langevin Dynamics. Due to its simplicity the algorithm lends itself to efficient implementation. By careful systems design and by exploiting the power law behavior of the data to maximize CPU cache bandwidth we are able to generate 1024 dimensional models at a rate of 8.5 million recommendations per second on a single PC

arXiv.org e-Print Archive

Crossref

Bayesian hierarchical clustering for studying cancer gene expression data with unknown statistics

Author: A Su
B Frey
C Nutt
C Rasmussen
C Rasmussen
D Arango
D Jiang
D Singh
David R. J. Snead
E Cooke
Ferdinando Di Cunto
G Brock
J Ihmels
J Yao
K Yeung
Korsuk Sirinukunwattana
L Hubert
L McQuitty
LF Wu
M De Souto
M Eisen
M Shipp
Muhammad F. Bari
Nasir M. Rajpoot
P D'haeseleer
P Laiho
R Neal
R Savage
R Sokal
Richard S. Savage
S Armstrong
S Datta
S Eschrich
S Falcon
S Matsui
S Pomeroy
S Ramaswamy
S Varambally
T Golub
Y Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Clustering analysis is an important tool in studying gene expression data. The Bayesian hierarchical clustering (BHC) algorithm can automatically infer the number of clusters and uses Bayesian model selection to improve clustering quality. In this paper, we present an extension of the BHC algorithm. Our Gaussian BHC (GBHC) algorithm represents data as a mixture of Gaussian distributions. It uses normal-gamma distribution as a conjugate prior on the mean and precision of each of the Gaussian components. We tested GBHC over 11 cancer and 3 synthetic datasets. The results on cancer datasets show that in sample clustering, GBHC on average produces a clustering partition that is more concordant with the ground truth than those obtained from other commonly used algorithms. Furthermore, GBHC frequently infers the number of clusters that is often close to the ground truth. In gene clustering, GBHC also produces a clustering partition that is more biologically plausible than several other state-of-the-art methods. This suggests GBHC as an alternative tool for studying gene expression data. The implementation of GBHC is available at https://sites. google.com/site/gaussianbhc

CiteSeerX

Public Library of Science (PLOS)

Qatar University Institutional Repository

Crossref

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository