Search CORE

88 research outputs found

Recognizing Treelike k-Dissimilarities

Author: A Schrijver
AD Gordon
Andreas Spillner
AWM Dress
AWM Dress
C Bocci
C Hayashi
D Levy
DP Faith
E Rubei
G Soete de
H-J Bandelt
H-J Bandelt
J Culberson
J Felsenstein
K Zaretsky
Katharina T. Huber
L Pachter
M Steel
M-M Deza
MJ Warrens
N Grishin
S Joly
Sven Herrmann
Vincent Moulton
WJ Heiser
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

A k-dissimilarity D on a finite set X, |X| >= k, is a map from the set of size k subsets of X to the real numbers. Such maps naturally arise from edge-weighted trees T with leaf-set X: Given a subset Y of X of size k, D(Y) is defined to be the total length of the smallest subtree of T with leaf-set Y . In case k = 2, it is well-known that 2-dissimilarities arising in this way can be characterized by the so-called "4-point condition". However, in case k > 2 Pachter and Speyer recently posed the following question: Given an arbitrary k-dissimilarity, how do we test whether this map comes from a tree? In this paper, we provide an answer to this question, showing that for k >= 3 a k-dissimilarity on a set X arises from a tree if and only if its restriction to every 2k-element subset of X arises from some tree, and that 2k is the least possible subset size to ensure that this is the case. As a corollary, we show that there exists a polynomial-time algorithm to determine when a k-dissimilarity arises from a tree. We also give a 6-point condition for determining when a 3-dissimilarity arises from a tree, that is similar to the aforementioned 4-point condition.Comment: 18 pages, 4 figure

arXiv.org e-Print Archive

Crossref

University of East Anglia digital repository

Stability of maxillary anterior crowding treatment

Author: Artun J
Bedi R
Bishara SE
Bondemark L
Burke SP
Canuto L
Dahlberg G
de la Cruz A
de Oliveira CM
Destang DL
Erdinc AE
Freitas KM
Harradine NW
Heiser W
Heiser W
Houston WJ
Huang L
Little RM
Little RM
Little RM
Martins P
Naraghi S
Nett BC
Ormiston JP
Pepicelli A
Riedel RA
Rothe LE
Southard TE
Taner TU
Vaden JL
Zhang M
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

A phase I trial of DNA vaccination with a plasmid expressing prostate-specific antigen in patients with hormone-refractory prostate cancer

Author: A Heiser
A Lundqvist
A Lundwall
A M Miller
A Palmborg
A-K Roos
B Bergman
BI Rini
DM Klinman
EP Diamandis
G Masucci
G Parmiani
J Gulley
JB Ulmer
JJ Kim
JM Timmerman
JP Eder
L Egevad
M Hellström
M Mincheff
M Noguchi
M Pavlenko
MA Liu
MP Velders
N Meidenbauer
NG Chakraborty
P Correale
P Pisa
P Wersäll
PG Coulie
PH Riegman
PL Lollini
R Kiessling
R Wang
RR MacGregor
S Calarota
S Nilsson
SA Rosenberg
SJ Freedland
ST Tagawa
TP Le
U Keilholz
V Ozenci
WJ Catalona
Y Sato
Publication venue: Nature Publishing Group
Publication date: 01/01/2004
Field of study

Crossref

PubMed Central

Enlighten

A flexible framework for sparse simultaneous component based data integration

Author: AE Hoerl
AL Barabasi
Anestis Antoniadis
D Lee
DM Witten
GJ McLachlan
H Kiers
H Zou
H Zou
HAL Kiers
I Borg
I Jolliffe
IT Jolliffe
Iven Van Mechelen
J de Leeuw
J Friedman
J Huang
JMF Ten Berge
K Lange
K Lemmens
K Van Deun
K Van Deun
K Van Deun
KA Le Cao
Katrijn Van Deun
KR Gabriel
L Meier
M de Tayrac
M Kowalski
M Yuan
MJ van der Werf
N Ishii
O Alter
P Zhao
PJF Groenen
R Jenatton
R Tibshirani
R van den Berg
Robert A van den Berg
S Hochreiter
S Ma
T Wilderjans
TF Wilderjans
Tom F Wilderjans
WJ Heiser
Y Kim
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract 1 Background High throughput data are complex and methods that reveal structure underlying the data are most useful. Principal component analysis, frequently implemented as a singular value decomposition, is a popular technique in this respect. Nowadays often the challenge is to reveal structure in several sources of information (e.g., transcriptomics, proteomics) that are available for the same biological entities under study. Simultaneous component methods are most promising in this respect. However, the interpretation of the principal and simultaneous components is often daunting because contributions of each of the biomolecules (transcripts, proteins) have to be taken into account. 2 Results We propose a sparse simultaneous component method that makes many of the parameters redundant by shrinking them to zero. It includes principal component analysis, sparse principal component analysis, and ordinary simultaneous component analysis as special cases. Several penalties can be tuned that account in different ways for the block structure present in the integrated data. This yields known sparse approaches as the lasso, the ridge penalty, the elastic net, the group lasso, sparse group lasso, and elitist lasso. In addition, the algorithmic results can be easily transposed to the context of regression. Metabolomics data obtained with two measurement platforms for the same set of <it>Escherichia coli </it>samples are used to illustrate the proposed methodology and the properties of different penalties with respect to sparseness across and within data blocks. 3 Conclusion Sparse simultaneous component analysis is a useful method for data integration: First, simultaneous analyses of multiple blocks offer advantages over sequential and separate analyses and second, interpretation of the results is highly facilitated by their sparseness. The approach offered is flexible and allows to take the block structure in different ways into account. As such, structures can be found that are exclusively tied to one data platform (group lasso approach) as well as structures that involve all data platforms (Elitist lasso approach). 4 Availability The additional file contains a MATLAB implementation of the sparse simultaneous component method.</p

Lirias

Crossref

Hal - Université Grenoble Alpes

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MTV and MGV: Two Criteria for Nonlinear PCA

Author: A Gifi
A Oppenheim
CSA Fischer
GPH Styan
H Konno
HO Lancaster
J Schur
JE Dennis Jr.
JP Benzécri
JR Flynn
JR Flynn
K Hushimi
K Tanabe
L Lebart
LA Manolakes
M Okamoto
M Okamoto
M Tenenhaus
MJ Greenacre
P Bekker
PB Imrey
PE Gill
R Jacoby
RJ Herrnstein
S Nishisato
T Otsu
T Otsu
T Otsu
T Saito
T Tsuchiya
TJ Bouchard Jr.
U. Neisser
WF Kuhfeld
WJ Heiser
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

ized Variance) are popular criteria for PCA with optimal scaling. They are adopted by the SAS-PRINQUAL procedure and OSMOD (Saito and Otsu,1988). MTV is an intuitive generalization of linear PCA criterion. We will show some proper-ties of nonlinear PCA with these criteria in an application to the data of NLSY79 (Zagorsky,1997), a large panel survey in the U.S., conducted over twenty years. We will show the following. (1) The effectiveness of PCA with optimal scaling as a tool for large social research data analysis. We can obtain useful results when it complements analyses by regression models. (2) Features of MTV and MGV, especially their abilities and deficiencies in real data analysis. 1

CiteSeerX

Crossref

Long-term stability of maxillary anterior alignment in non-extraction cases

Author: Artun J
Busdrang PH
Canuto LF
Dahlberg G
Edwards JG
Edwards JG
Edwards JG
Erdinc AE
Ferris T
Heiser W
Houston WJ
Huang L
Kahl-Nieke B
Little RM
Little RM
Little RM
Little RM
Moussa R
Parker WS
Richardson ME
Rossouw PE
Sadowsky C
Sadowsky C
Sinclair PM
Surbeck BT
Swanson WD
Taner TU
Thilander B
Uhde MD
Vaden JL
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

Recommended from our members

Computational solutions for omics data

Author: A Butte
A Chatr-aryamontri
A Franceschini
A Joshi
A Lan
A Mortazavi
A Subramanian
A Tanay
AC Jungkamp
AJ Pinho
AK Wong
AR Whitney
B Langmead
B Langmead
B Paten
Bonnie Berger
BP Kelley
C Huttenhower
C Kingsford
C Trapnell
C Trapnell
C Trapnell
C Wang
CH Yeang
CJ Vaske
CS Liao
D Croft
D Earl
D Kim
D Kim
D Park
DB Allison
DB Jaffe
DR Zerbino
E Banks
E Banks
E Cerami
E Nabieva
E Segal
E Yeger-Lotem
EJ Rossin
ER Mardis
ES Lander
ET Wang
F Hach
F Hach
F Markowetz
F Ozsolak
F Vandin
F Vandin
F Vezzi
GE Zinman
H Li
H Li
I Ulitsky
I Ulitsky
IA Adzhubei
J Butler
J Clarke
J Flannick
J Goecks
J Lamb
J Pandey
JC Marioni
JC Venter
Jian Peng
JT Dudley
JT Leek
JT Simpson
JT Simpson
K Rhrissorrakrai
KI Goh
KY Yeung
L Parts
LD Stein
LH Hartwell
LM Heiser
LR Meyer
M Ascano
M Burrows
M Garber
M Gross
M Gstaiger
M Hafner
M Hsi-Yang Fritz
M Kircher
M Koyuturk
M Narayanan
M Reich
M Schatz
M Schmid
M Sirota
M Steffen
M Yandell
MB Gerstein
MB Gerstein
MC Brandon
MC Schatz
MG Grabherr
MH Maathuis
ML Metzker
Mona Singh
N Atias
N de Souza
N Tuncbag
NP Palmer
NT Ingolia
O Hirose
O Litvin
O Ogasawara
O Stegle
O Vanunu
P Ferragina
P Flicek
P Jiang
P Kumar
P Lu
P Shannon
PA Pevzner
PE Compeau
PG Doyle
PO Brown
PR Loh
PR Schmid
R Colak
R Gaujoux
R Li
R Li
R Li
R Singh
RC Gentleman
S Anders
S Batzoglou
S Christley
S Deorowicz
S Erten
S Kohler
S Levy
S Navlakha
S Ng
S Suthram
SA Chowdhury
SD Kahn
SF Altschul
SG Tringe
SL Salzberg
SS Huang
SS Shen-Orr
T Barrett
T Ideker
T Michoel
TS Furey
U Manber
UD Akavia
W Ali
W Li
W Tembe
WJ Kent
X Liu
X Wang
X Zhou
Y Prat
Y Wang
Y Zhang
YA Kim
Z Tu
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2013
Field of study

High-throughput experimental technologies are generating increasingly massive and complex genomic data sets. The sheer enormity and heterogeneity of these data threaten to make the arising problems computationally infeasible. Fortunately, powerful algorithmic techniques lead to software that can answer important biomedical questions in practice. In this Review, we sample the algorithmic landscape, focusing on state-of-the-art techniques, the understanding of which will aid the bench biologist in analysing omics data. We spotlight specific examples that have facilitated and enriched analyses of sequence, transcriptomic and network data sets.National Institutes of Health (U.S.) (Grant GM081871

Princeton University Open Access Repository

DSpace@MIT

Crossref

PubMed Central

Diagnostics for single-peakedness of item responses with ordered conditional means (OCM)

Author: De Rooij M
Heiser WJ
Polak Marike
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2012
Field of study

EUR Research Repository

Interpreting degenerate solutions in unfolding by use of the vector model and the compensatory distance model

Author: Busing FMTA
Delbeke L
Deun K
Groenen Patrick
Heiser WJ
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

catholic university of leuven In this paper, we reconsider the merits of unfolding solutions based on loss functions involving a normalization on the variance per subject. In the literature, solutions based on Stress-2 are often diagnosed to be degenerate in the majority of cases. Here, the focus lies on two frequently occurring types of degen-eracies. The first type typically locates some subject points far away from a compact cluster of the other points. In the second type of solution, the object points lie on a circle. In this paper, we argue that these degenerate solutions are well fitting and informative. To reveal the information, we introduce mixtures of plots based on the ideal point model of unfolding, the vector model, and on the signed distance model. In addition to a different representation, we provide a new iterative majorization algorithm to optimize the average squared correlation between the distances in the configuration and the transformed data per individual. It is shown that this approach is equivalent to minimizing Kruskal’s Stress-2

CiteSeerX

EUR Research Repository

Interpreting degenerate solutions in unfolding by use of the vector model and the compensatory distance model

Author: Busing FMTA
Delbeke L
Deun K
Groenen Patrick
Heiser WJ
Publication venue: 'Faculty of Mathematics, Computer Science and Econometrics, University of Zielona Gora'
Publication date: 01/01/2005
Field of study

EUR Research Repository