Search CORE

268 research outputs found

Mixture model with multiple allocations for clustering spatially correlated observations in the analysis of ChIP-Seq data

Author: Banfield
Bao
Battle
Bezdek
Fernández
Fraley
Frühwirth-Schnatter
Green
Kuan
McLachlan
Mo
Nagalakshmi
Pettitt
Raftery
Ramos
Spiegelhalter
Stephens
Thomas
Xing
Zhang
Publication venue
Publication date: 12/05/2016
Field of study

Model-based clustering is a technique widely used to group a collection of units into mutually exclusive groups. There are, however, situations in which an observation could in principle belong to more than one cluster. In the context of Next-Generation Sequencing (NGS) experiments, for example, the signal observed in the data might be produced by two (or more) different biological processes operating together and a gene could participate in both (or all) of them. We propose a novel approach to cluster NGS discrete data, coming from a ChIP-Seq experiment, with a mixture model, allowing each unit to belong potentially to more than one group: these multiple allocation clusters can be flexibly defined via a function combining the features of the original groups without introducing new parameters. The formulation naturally gives rise to a `zero-inflation group' in which values close to zero can be allocated, acting as a correction for the abundance of zeros that manifest in this type of data. We take into account the spatial dependency between observations, which is described through a latent Conditional Auto-Regressive process that can reflect different dependency patterns. We assess the performance of our model within a simulation environment and then we apply it to ChIP-seq real data.Comment: 25 pages; 3 tables, 6 figure

arXiv.org e-Print Archive

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Dissertations of the University of Groningen

Dyadic and mediation analyses of coping with cardiovascular disease

Author: Bartholomew
Bodenmann
Ciechanowski
Cleveland G. Shields
Cook
Coyne
Coyne
DiMatteo
Feeney
Feeney
Fraley
Franks
Kiecolt-Glaser
Luecken
Marcus
Maunder
Nilufer Kafescioglu
Norton
Orth-Gomer
Volker Thomas
Ware
Publication venue: Published by Elsevier Ltd.
Publication date: 01/01/2010
Field of study

AbstractThe purpose of this study was to investigate the relationship between attachment security and health outcomes of cardiac patients and their spouses. Dyadic coping and relationship quality were proposed to mediate this relationship. Participants were 72 couples in which one member of the couple was participating in cardiac rehabilitation. Results showed that participants with higher attachment avoidance perceived their general and mental health worse and were less likely to exercise. Patients with higher attachment avoidance perceived their partner as less supportive and this was negatively associated with their general and mental health. Spouses’ positive support and marital happiness partially mediated the relationship between their attachment anxiety and mental health. Patients with spouses with higher attachment anxiety exercised more; whereas spouses of patients with higher attachment anxiety exercised less

Elsevier - Publisher Connector

Crossref

Dogus University Institutional Repository

Idiopathic pulmonary fibrosis: Prognostic value of changes in physiology and six minute hallwalk.

Author: Andrei Adin-Cristian
Colby Thomas V.
Flaherty Kevin R.
Fraley Chris
Gross Barry H.
Kazerooni Ella A.
Lama Vibha
Martinez Fernando J.
Murray Susan
Toews Galen B.
Travis William D.
Publication venue
Publication date: 01/01/2006
Field of study

Rationale and Hypothesis: Idiopathic pulmonary fibrosis is a fatal disease with a variable rate of progression. We hypothesized that changes in distance walked and quantity of desaturation during a six-minute-walk test (6MWT) would add prognostic information to changes in FVC or diffusing capacity for carbon monoxide. Methods: One hundred ninety-seven patients with idiopathic pulmonary fibrosis were evaluated. Desaturation during the 6MWT was associated with increased mortality even if a threshold of 88% was not reached. Baseline walk distance predicted subsequent walk distance but was not a reliable predictor of subsequent mortality in multivariate survival models. The predictive ability of serial changes in physiology varied when patients were stratified by the presence/absence of desaturation 88% during a baseline 6MWT. For patients with a baseline saturation 88% during a 6MWT, the strongest observed predictor of mortality was serial change in diffusing capacity for carbon monoxide. For patients with saturation 88% during their baseline walk test, serial decreases in FVC and increases in desaturation area significantly predicted subsequent mortality, whereas decreases in walk distance and in diffusing capacity for carbon monoxide displayed less consistent statistical evidence of increasing mortality in our patients. Conclusion: These data highlight the importance of stratifying patients by degree of desaturation during a 6MWT before attributing prognostic value to serial changes in other physiologic variables.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/91940/1/2006 AJRCCM Idiopathic pulmonary fibrosis - Prognostic value of changes in physiology and six minute hallwalk.pd

Deep Blue Documents

Multimodality in galaxy clusters from SDSS DR8: substructure and velocity distribution

Author: A. Ahvensalmi
Aguerri
Aihara
Andrade-Santos
Anscombe
Araya-Melo
Barrena
Belsole
Berlind
Berrier
Bird
Biviano
Blanton
Blanton
Blanton
Boschin
Boschin
Burgett
Böhringer
Coziol
de Lapparent
Donnelly
Dressler
Dressler
Durret
D’Agostino
E. Saar
E. Tago
E. Tempel
Einasto
Einasto
Einasto
Einasto
Einasto
Einasto
Fakhouri
Flin
Fraley
Gal
Gastaldello
Giocoli
Gregory
Holopainen
Hou
Huchra
Huertas-Company
Hwang
Hwang
Ihaka
J. Einasto
J. Vennik
Jeeson-Daniel
Jõeveer
Knebe
Kolokotronis
L. J. Liivamägi
Loeb
M. Einasto
Malumuth
Martínez
McIntosh
Merritt
Merritt
Mo
Niemi
Oegerle
Ostriker
Owers
Owers
P. Heinämäki
P. Nurmi
Piffaretti
Pimbblet
Pinkney
Ramella
Ribeiro
Richstone
Serna
Shapiro
Skibba
Skibba
Smith
Solanes
Swinbank
Tago
Tago
Tempel
Tempel
Tempel
Thomas
Tovmassian
Turner
V. J. Martínez
van den Bosch
Vennik
West
West
White
White
White
Yang
Zeldovich
Publication venue: 'EDP Sciences'
Publication date: 22/02/2012
Field of study

We search for the presence of substructure, a non-Gaussian, asymmetrical velocity distribution of galaxies, and large peculiar velocities of the main galaxies in galaxy clusters with at least 50 member galaxies, drawn from the SDSS DR8. We employ a number of 3D, 2D, and 1D tests to analyse the distribution of galaxies in clusters: 3D normal mixture modelling, the Dressler-Shectman test, the Anderson-Darling and Shapiro-Wilk tests and others. We find the peculiar velocities of the main galaxies, and use principal component analysis to characterise our results. More than 80% of the clusters in our sample have substructure according to 3D normal mixture modelling, the Dressler-Shectman (DS) test shows substructure in about 70% of the clusters. The median value of the peculiar velocities of the main galaxies in clusters is 206 km/s (41% of the rms velocity). The velocities of galaxies in more than 20% of the clusters show significant non-Gaussianity. While multidimensional normal mixture modelling is more sensitive than the DS test in resolving substructure in the sky distribution of cluster galaxies, the DS test determines better substructure expressed as tails in the velocity distribution of galaxies. Richer, larger, and more luminous clusters have larger amount of substructure and larger (compared to the rms velocity) peculiar velocities of the main galaxies. Principal component analysis of both the substructure indicators and the physical parameters of clusters shows that galaxy clusters are complicated objects, the properties of which cannot be explained with a small number of parameters or delimited by one single test. The presence of substructure, the non-Gaussian velocity distributions, as well as the large peculiar velocities of the main galaxies, shows that most of the clusters in our sample are dynamically young.Comment: 15 pages, 11 figures, 2 online tables, accepted for publication in Astronomy and Astrophysic

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Development and Validation of the Behavioral Tendencies Questionnaire

Author: A Brown
A Brown
A Brown
A Maydeu-Olivares
A Thomas
Anna Brown
B Buddhaghosa
CG DeYoung
CG DeYoung
CG DeYoung
CK Conners
CK Haddock
CR Cloninger
CS Carver
D Bartram
D Frawley
DE Evans
DE Evans
DJ Jackson
DM Amodio
DM Buss
DP McAdams
H van Herk
IA Smits
IL Janis
J Kabat-Zinn
J Kornfield
J Specht
JA Gray
Jake H. Davis
JC Norcross
JM Digman
JR Gray
JS Adelstein
Judson A. Brewer
K Busia
K Casler
KL Nylund
L Campbell-Sills
L Mann
LK Muthen
LL Couch
M Buhrmester
Martin Voracek
MD Ainsworth
MF Scheier
MWL Cheung
Nicholas T. Van Dam
OP John
PJ Corr
R Gethin
R Mann
RC Fraley
RH Moos
RH Moos
Tom B. Mole
W Chan
W Chan
Willoughby B. Britton
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 04/11/2015
Field of study

At a fundamental level, taxonomy of behavior and behavioral tendencies can be described in terms of approach, avoid, or equivocate (i.e., neither approach nor avoid). While there are numerous theories of personality, temperament, and character, few seem to take advantage of parsimonious taxonomy. The present study sought to implement this taxonomy by creating a questionnaire based on a categorization of behavioral temperaments/tendencies first identified in Buddhist accounts over fifteen hundred years ago. Items were developed using historical and contemporary texts of the behavioral temperaments, described as “Greedy/Faithful”, “Aversive/Discerning”, and “Deluded/Speculative”. To both maintain this categorical typology and benefit from the advantageous properties of forced-choice response format (e.g., reduction of response biases), binary pairwise preferences for items were modeled using Latent Class Analysis (LCA). One sample (n1 = 394) was used to estimate the item parameters, and the second sample (n2 = 504) was used to classify the participants using the established parameters and cross-validate the classification against multiple other measures. The cross-validated measure exhibited good nomothetic span (construct-consistent relationships with related measures) that seemed to corroborate the ideas present in the original Buddhist source documents. The final 13-block questionnaire created from the best performing items (the Behavioral Tendencies Questionnaire or BTQ) is a psychometrically valid questionnaire that is historically consistent, based in behavioral tendencies, and promises practical and clinical utility particularly in settings that teach and study meditation practices such as Mindfulness Based Stress Reduction (MBSR)

Public Library of Science (PLOS)

City University of New York

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship@UMMS

Kent Academic Repository

University of Melbourne Institutional Repository

Orientation-dependent backbone-only residue pair scoring functions for fixed backbone protein design

Author: A Senes
Andrew J Bordner
AP Dempster
B Kuhlman
B Rost
B Rost
B Rost
BW Silverman
C Chothia
C Fraley
C Fraley
C Sander
CA Rohl
D Tobi
DW Scott
EP Baldwin
G Schwartz
GE Tusnady
GJ McLachlan
I Bahar
J Kleinberg
J Kopp
J Pearl
JE Fitzgerald
JF Gibrat
JP Kocher
JS Yedidia
JU Bowie
K Mizuguchi
K Onizuka
L Kaufman
LA Mirny
M Lu
MJ Sippl
NV Buchete
NV Buchete
PD Thomas
R Bellman
R Development Core Team
R Rajgaria
R Rajgaria
R Samudrala
RA Abagyan
RC Edgar
S Miyazawa
S Miyazawa
S Miyazawa
S Miyazawa
SB Nabuurs
SF Altschul
U Hobohm
W Kabsh
W Li
Y Weiss
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Empirical scoring functions have proven useful in protein structure modeling. Most such scoring functions depend on protein side chain conformations. However, backbone-only scoring functions do not require computationally intensive structure optimization and so are well suited to protein design, which requires fast score evaluation. Furthermore, scoring functions that account for the distinctive relative position and orientation preferences of residue pairs are expected to be more accurate than those that depend only on the separation distance. Results Residue pair scoring functions for fixed backbone protein design were derived using only backbone geometry. Unlike previous studies that used spherical harmonics to fit 2D angular distributions, Gaussian Mixture Models were used to fit the full 3D (position only) and 6D (position and orientation) distributions of residue pairs. The performance of the 1D (residue separation only), 3D, and 6D scoring functions were compared by their ability to identify correct threading solutions for a non-redundant benchmark set of protein backbone structures. The threading accuracy was found to steadily increase with increasing dimension, with the 6D scoring function achieving the highest accuracy. Furthermore, the 3D and 6D scoring functions were shown to outperform side chain-dependent empirical potentials from three other studies. Next, two computational methods that take advantage of the speed and pairwise form of these new backbone-only scoring functions were investigated. The first is a procedure that exploits available sequence data by averaging scores over threading solutions for homologs. This was evaluated by applying it to the challenging problem of identifying interacting transmembrane alpha-helices and found to further improve prediction accuracy. The second is a protein design method for determining the optimal sequence for a backbone structure by applying Belief Propagation optimization using the 6D scoring functions. The sensitivity of this method to backbone structure perturbations was compared with that of fixed-backbone all-atom modeling by determining the similarities between optimal sequences for two different backbone structures within the same protein family. The results showed that the design method using 6D scoring functions was more robust to small variations in backbone structure than the all-atom design method. Conclusions Backbone-only residue pair scoring functions that account for all six relative degrees of freedom are the most accurate and including the scores of homologs further improves the accuracy in threading applications. The 6D scoring function outperformed several side chain-dependent potentials while avoiding time-consuming and error prone side chain structure prediction. These scoring functions are particularly useful as an initial filter in protein design problems before applying all-atom modeling.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Tomato: a crop species amenable to improvement by cellular and molecular methods

Author: A. Crossway
A. Guri
A. Hoekema
A. Kinshara
A. Morgan
A. Zelcer
A.H. Paterson
A.J. Framond De
A.L. Phillips
A.L. Phillips
A.R. Krol Van der
B. Baker
B. Baker
B. Hause
B. McClintock
B. Piechulla
B.A. Levenko
B.D. Harrison
B.L. Ward
B.P. Moens
B.R. Thomas
B.R. Thomas
B.S. Gill
B.S. Landry
B.S. Landry
B.S. Landry
C. Chang
C. Pieterse
C. Waldron
C.D. Riggs
C.E. Vallejos
C.E. Vallejos
C.J.S. Smith
C.M. Rick
C.M. Rick
C.M. Rick
C.M. Rick
C.M. Rick
C.M. Rick
C.M. Rick
C.M. Rick
C.M. Smith
C.P. Meredith
D. Boer De
D. Botstein
D. Shah
D. Zamir
D. Zamir
D. Zamir
D.A. Evans
D.A. Evans
D.M. Barker
D.M. Lonsdale
D.P. Barlow
D.W. Barton
D.W. Galbraith
E. Haute Van
E. Pichersky
E.A. Frankenberger
E.A. Frankenberger
E.A. Shahin
E.A. Shahin
E.C. Tigchelaar
F.J. Zapata
G. Gavazzi
G. Melchers
G. Schweizer
G.S. Khush
G.S. Khush
H.P. Mühlbach
I. Negrutiu
I.M. Greenblat
J. Botterman
J. Haseloff
J. Hille
J. Hille
J. Paszkowski
J. Phillips
J. Wijbrandi
J.D. Palmer
J.D. Palmer
J.D. Palmer
J.D. Palmer
J.F. Gusella
J.F. Gusella
J.F. Shepard
J.G. Atherton
J.H. Hawkes
J.I. Yoder
J.J. Fillatti
J.L. Bennetzen
J.M. van Montagu Schell
J.S. Beckmann
J.S. Beckmann
J.W. Maxon-Smith
J.W. Verna De
Jacoues Hille
K. Ohyama
K. Shinozaki
K. Umesono
K.A. Darden
K.K. Kartha
L. Herrera-Estrella
L. Herrera-Estrella
L. Marton
L.A. Miki
L.P. Hosticka
L.W. Handley
M. Block De
M. Buiatti
M. Buiatti
M. Cuozzo
M. Jongsma
M. Koornneef
M. Koornneef
M. Mutschler
M. S. Ramanna
M. Sibi
M. Soller
M. Vaeek
M. Wallroth
M. Ziv
M.A. O'Connell
M.A. O'Connell
M.A. O'Connell
M.A. O'Connell
M.A. Sluys Van
M.A. Stevens
M.A. Stevens
M.D. Bennett
M.L.M.C. Tan
M.L.M.C. Tan
M.R. Hanson
M.S. Ramanna
M.S. Ramanna
M.S. Ramanna
M.W. Bevan
M.W. Ganal
M.Y. Menzel
Maarten Koornneef
N.A. Zagorska
N.D. Young
N.D. Young
N.E. Tumer
N.V. Federoff
P. Little
P. Zambryski
P.E. McClean
P.F. Ambros
P.G. Smith
P.J.M. Elzen Van den
P.J.M. Elzen Van den
P.M. Gresshoff
Pim Zabel
R. Bernatzky
R. Bernatzky
R. Ecochard
R. Hain
R. Niedz
R.A. Bressan
R.A.J. Daelen Van
R.B. Flavell
R.B. Horsch
R.D. Locky
R.D. Shillito
R.S. Nelson
R.T. Fraley
S. Dellaporta
S. McCormick
S. Ohki
S.D. Tanksley
S.D. Tanksley
S.D. Tanksley
S.D. Tanksley
S.D. Tanksley
S.D. Tanksley
St.G. Rogers
T. Helentjaris
T. Helentjaris
T. Helentjaris
T.C. Osborn
T.L. Adams
T.M. Klein
U. Wienand
V. Padmanabhan
V.A. Hilder
W. Gottschalk
W. Gruissem
W.L. Gerlach
W.R. Sharp
Y.S. Chyi
Z. Koukolikova-Nicola
Z. Tabaeizadeh
Publication venue
Publication date: 01/01/1989
Field of study

Tomato is a crop plant with a relatively small DNA content per haploid genome and a well developed genetics. Plant regeneration from explants and protoplasts is feasable which led to the development of efficient transformation procedures. In view of the current data, the isolation of useful mutants at the cellular level probably will be of limited value in the genetic improvement of tomato. Protoplast fusion may lead to novel combinations of organelle and nuclear DNA (cybrids), whereas this technique also provides a means of introducing genetic information from alien species into tomato. Important developments have come from molecular approaches. Following the construction of an RFLP map, these RFLP markers can be used in tomato to tag quantitative traits bred in from related species. Both RFLP's and transposons are in the process of being used to clone desired genes for which no gene products are known. Cloned genes can be introduced and potentially improve specific properties of tomato especially those controlled by single genes. Recent results suggest that, in principle, phenotypic mutants can be created for cloned and characterized genes and will prove their value in further improving the cultivated tomato.

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Wageningen University & Research Publications

University of Groningen Digital Archive

Dissertations of the University of Groningen

Dynamics of the NGC 4636 globular cluster system II. Improved constraints from a large sample of globular cluster velocities

Author: Annibali
Avni
B. Dirsch
Baldi
Bender
Bergond
Binggeli
Binney
Bruzual
Bullock
Burkert
Burstein
Calamida
Cappellari
Cappellari
Chakrabarty
Chakrabarty
Côté
Dirsch
Dirsch
Donato
Evans
Famaey
Forman
Fraley
Gentile
Gerhard
Hilker
Jensen
Johnson
Jones
Kissler
Kronawitter
Kroupa
Lee
Loewenstein
Loewenstein
M. Hilker
Magorrian
Mamon
Maraston
Matsushita
McGaugh
Merritt
Milgrom
Milgrom
Navarro
Park
Percival
Proctor
Prugniel
Pryor
Pu
R. Salinas
Richtler
Richtler
S. S. Larsen
Sanders
Schlegel
Schuberth
Schuberth
Schuberth
Schwarz
Stark
Stephens
Stetson
Strader
T. Richtler
Tal
Temi
Temi
Thomas
Thomas
Tonry
Tonry
Trachternach
van de Ven
van der Marel
Vazdekis
Vazdekis
Weidner
Weijmans
Woodley
Y. Schuberth
Publication venue: 'EDP Sciences'
Publication date: 01/01/2012
Field of study

We present new radial velocities for 289 globular clusters around NGC 4636, the southernmost giant elliptical galaxy of the Virgo cluster. The data were obtained with FORS2/MXU at the Very Large Telescope. Together with data analysed in an earlier study (Schuberth et al. 2006), we now have a sample of 460 globular cluster velocities out to a radius of 12 arcmin (60 kpc) available - one of the largest of its kind. This new data set also provides a much more complete angular coverage. Moreover, we present new kinematical data of the inner stellar population of NGC 4636. We perform an updated Jeans analysis, using both stellar and GC data, to better constrain the dark halo properties. We find a stellar M/L-ratio of 5.8 in the R-band, higher than expected from single stellar population synthesis. We model the dark halo by cored and cuspy analytical halo profiles and consider different anisotropies for the tracer populations. Properties of NFW halos lie well within the expected range of cosmological simulations. Cored halos give central dark matter densities, which are typical for elliptical galaxies of NGC 4636's luminosity. The surface densities of the dark matter halos are higher than those of spiral galaxies. We compare the predictions of Modified Newtonian Dynamics with the derived halo properties and find satisfactory agreement. Therefore NGC 4636 therefore falls onto the baryonic Tully-Fisher relation for spiral galaxies. The comparison with the X-ray mass profile of Johnson et al. (2009) reveals satisfactory agreement only, if the abundance gradient of hot plasma has been taken into account. This might indicate a general bias towards higher masses for X-ray based mass profiles in all systems, including galaxy clusters, with strong abundance gradients.Comment: 47 pages, 24 figures. Accepted for publication in Astronomy & Astrophysic

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Radboud Repository (Radboud Univ.)

Neural Crest Cell Survival Is Dependent on Rho Kinase and Is Required for Development of the Mid Face in Mouse Embryos

Author: A Beverdam
A Carisey
AG Porter
B Fogelgren
Bill Chaudhry
BM McBratney
C Carmona-Fontaine
D Thumkeo
D Thumkeo
Deborah J. Henderson
DR Cordero
F Wiradjaja
GN Serbedzija
Helen M. Phillips
Helena Soenen
K Iwao
K Kobayashi
L Wei
M Amano
M Amano
M Amano
M Fukata
M Groysman
M Minoux
M Raftopoulou
MD Abramoff
P Soriano
Patricia Ybot-Gonzalez
Patrick Callaerts
PS Danielian
PS Thomas
RD Knight
S Qu
S Srinivas
SI Fraley
SR Twigg
Tania Papoutsi
V Hildreth
VE Franklin-Tong
W Ma
WP Daley
Y Lallemand
Y Shimizu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Neural crest cells (NCC) give rise to much of the tissue that forms the vertebrate head and face, including cartilage and bone, cranial ganglia and teeth. In this study we show that conditional expression of a dominant-negative (DN) form of Rho kinase (Rock) in mouse NCC results in severe hypoplasia of the frontonasal processes and first pharyngeal arch, ultimately resulting in reduction of the maxilla and nasal bones and severe craniofacial clefting affecting the nose, palate and lip. These defects resemble frontonasal dysplasia in humans. Disruption of the actin cytoskeleton, which leads to abnormalities in cell-matrix attachment, is seen in the RockDN;Wnt1-cre mutant embryos. This leads to elevated cell death, resulting in NCC deficiency and hypoplastic NCC-derived craniofacial structures. Rock is thus essential for survival of NCC that form the craniofacial region. We propose that reduced NCC numbers in the frontonasal processes and first pharyngeal arch, resulting from exacerbated cell death, may be the common mechanism underlying frontonasal dysplasia

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Digital.CSIC

Fondo Bibliográfico Digital Institucional

The Francis Crick Institute

Mixture of latent trait analyzers for model-based clustering of categorical data

Author: A. Frank
A. Hadgu
A.E. Raftery
A.P. Dempster
B. Efron
B. Muthén
C. Biernacki
C. Fraley
C.M. Bishop
D. Karlis
D.J. Bartholomew
D.J. Bartholomew
D.J. Bartholomew
D.K. Pauler
E.A. Erosheva
E.A. Erosheva
E.A. Erosheva
E.S. Allman
G. Celeux
G. McLachlan
G. McLachlan
G. McLachlan
G. Rasch
G. Schwarz
Isabella Gollini
J. Baek
J. Rost
J. Rost
J. Vermunt
J. Vermunt
J. Vermunt
J.L. Andrews
J.S. Uebersax
L.A. Goodman
M. Abramowitz
M. Davier von
M. Davier von
M.D. Sammel
M.E. Tipping
N. Dean
P.D. McNicholas
P.D. McNicholas
R.D. Bock
S. Brin
S.E. Fienberg
T.I. Lin
T.I. Lin
T.S. Jaakkola
Thomas Brendan Murphy
Y. Qu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/02/2013
Field of study

Model-based clustering methods for continuous data are well established and commonly used in a wide range of applications. However, model-based clustering methods for categorical data are less standard. Latent class analysis is a commonly used method for model-based clustering of binary data and/or categorical data, but due to an assumed local independence structure there may not be a correspondence between the estimated latent classes and groups in the population of interest. The mixture of latent trait analyzers model extends latent class analysis by assuming a model for the categorical response variables that depends on both a categorical latent class and a continuous latent trait variable; the discrete latent class accommodates group structure and the continuous latent trait accommodates dependence within these groups. Fitting the mixture of latent trait analyzers model is potentially difficult because the likelihood function involves an integral that cannot be evaluated analytically. We develop a variational approach for fitting the mixture of latent trait models and this provides an efficient model fitting strategy. The mixture of latent trait analyzers model is demonstrated on the analysis of data from the National Long Term Care Survey (NLTCS) and voting in the U.S. Congress. The model is shown to yield intuitive clustering results and it gives a much better fit than either latent class analysis or latent trait analysis alone

arXiv.org e-Print Archive

Crossref

Birkbeck Institutional Research Online

Explore Bristol Research