Search CORE

28 research outputs found

A general asymptotic scheme for inference under order restrictions

Author: Anevski D.
Hössjer O.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2006
Field of study

Limit distributions for the greatest convex minorant and its derivative are considered for a general class of stochastic processes including partial sum processes and empirical processes, for independent, weakly dependent and long range dependent data. The results are applied to isotonic regression, isotonic regression after kernel smoothing, estimation of convex regression functions, and estimation of monotone and convex density functions. Various pointwise limit distributions are obtained, and the rate of convergence depends on the self similarity properties and on the rate of convergence of the processes considered.Comment: Published at http://dx.doi.org/10.1214/009053606000000443 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Lund University Publications

Chalmers Research

Chalmers Publication Library

Generalizing univariate signed rank statistics for testing and estimating a multivariate location parameter.

Author: Croux C.
Hössjer O.
Publication venue
Publication date
Field of study

We generalize signed rank statistics to dimensions higher than one. This results in a class of orthogonally invariant and distribution free tests that can be used for testing spherical symmetry/location parameter. The corresponding estimator is orthogonally equivariant. Both the test and estimator can be chosen with asymptotic efficiency 1. The breakdown point of the estimator depends only on the scores, not on the dimension of the data. For elliptical distributions, we obtain an affine invariant test with the same asymptotic properties, if the signed rank statistic is applied to standardized data. We also present a method for computing the estimator numerically, and consider a real data example and some simulations. Finally, an application to detection of time-varying signals in spherically symmetric noise is given.Affine invariant tests; Asymptotic normality; Breakdown point; distribution free tests;

Research Papers in Economics

Generalized S-estimators.

Author: Croux Christophe
Hössjer O.
Rousseeuw Peter
Publication venue
Publication date
Field of study

In this paper we introduce a new type of positive-breakdown regression method, called a generalized S-estimator (or GS-estimator), based on the minimization of a generalized M-estimator of residual scale. We compare the class of GS-estimators with the usual S-estimators, including least median of squares. It turns out that GS-estimators attain a much higher efficiency than S-estimators, at the cost of a slightly increased worst-case bias. We investigate the breakdown point, the maxbias curve and the influence function of GS-estimators. We also give an algorithm for computing GS-estimators, and apply it to real and simulated data.Breakdown point; Influence function; Maxbias curve; Regression analysis; Robustness;

Research Papers in Economics

A Fast Algorithm for Robust Regression with Penalised Trimmed Squares

Author: A Giloni
AC Atkinson
AC Atkinson
AS Hadi
C Agostinelli
CW Coakley
D Gervini
D Peña
D Peña
DM Hawkins
DM Hawkins
DM Hawkins
DM Sebert
G Zioutas
G Zioutas
G. Zioutas
J Agulló
JF Gentleman
L. Pitsoulis
LM Li
LS Pitsoulis
M Salibian-Barrera
MS Bazaraa
N Billor
N Billor
N Billor
O Hössjer
PJ Rousseeuw
PJ Rousseeuw
PJ Rousseeuw
PJ Rousseeuw
RJ Rousseeuw
TA Feo
VJ Yohai
VJ Yohai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

The presence of groups containing high leverage outliers makes linear regression a difficult problem due to the masking effect. The available high breakdown estimators based on Least Trimmed Squares often do not succeed in detecting masked high leverage outliers in finite samples. An alternative to the LTS estimator, called Penalised Trimmed Squares (PTS) estimator, was introduced by the authors in \cite{ZiouAv:05,ZiAvPi:07} and it appears to be less sensitive to the masking problem. This estimator is defined by a Quadratic Mixed Integer Programming (QMIP) problem, where in the objective function a penalty cost for each observation is included which serves as an upper bound on the residual error for any feasible regression line. Since the PTS does not require presetting the number of outliers to delete from the data set, it has better efficiency with respect to other estimators. However, due to the high computational complexity of the resulting QMIP problem, exact solutions for moderately large regression problems is infeasible. In this paper we further establish the theoretical properties of the PTS estimator, such as high breakdown and efficiency, and propose an approximate algorithm called Fast-PTS to compute the PTS estimator for large data sets efficiently. Extensive computational experiments on sets of benchmark instances with varying degrees of outlier contamination, indicate that the proposed algorithm performs well in identifying groups of high leverage outliers in reasonable computational time.Comment: 27 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

Statistical quality assessment and outlier detection for liquid chromatography-mass spectrometry experiments

Author: A Fraser
A Prakash
AI Nesvizhskii
BM Mayr
C Croux
CS Brown
DA Stead
E Machtejevas
Egidijus Machtejevas
F Model
GV Cohen Freue
Hartmut Schlüter
J Harezlak
J Listgarten
Joachim Thiemann
K Choo
K Flikka
K Pearson
KC Leptos
Klaus Unger
Knut Reinert
KR Coombes
M Bern
M Mann
M Sturm
M Xu
O Hössjer
O Kohlbacher
O Schulz-Trieglaff
O Schulz-Trieglaff
Ole Schulz-Trieglaff
P Mahalanobis
RE Moore
S Cappadona
S Na
T Whistler
W Windig
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Quality assessment methods, that are common place in engineering and industrial production, are not widely spread in large-scale proteomics experiments. But modern technologies such as Multi-Dimensional Liquid Chromatography coupled to Mass Spectrometry (LC-MS) produce large quantities of proteomic data. These data are prone to measurement errors and reproducibility problems such that an automatic quality assessment and control become increasingly important. Results We propose a methodology to assess the quality and reproducibility of data generated in quantitative LC-MS experiments. We introduce quality descriptors that capture different aspects of the quality and reproducibility of LC-MS data sets. Our method is based on the Mahalanobis distance and a robust Principal Component Analysis. Conclusion We evaluate our approach on several data sets of different complexities and show that we are able to precisely detect LC-MS runs of poor signal quality in large-scale studies.</p

Crossref

Directory of Open Access Journals

Repository: Freie Universität Berlin (FU), Math Department (fu_mi_publications)

PubMed Central

Temporal Dynamics of Host Molecular Responses Differentiate Symptomatic and Asymptomatic Influenza A Infection

Author: A Hero
A Ma
A Ryo
A Sabbah
A Subramanian
A Takaoka
Aimee K. Zaas
AK Zaas
Alfred O. Hero
AP Manderson
Arvind Rao
B Barrett
B Efron
BD Korman
BM Bolstad
Bradley Nicholson
C Abraham
C Cilloniz
C Cilloniz
C Palmer
CE Samuel
CE Samuel
Christopher W. Woods
CM Pombo
CV Rothlin
D Proud
DB Stetson
DC Kang
E Durand
F Carrat
F Martinon
F Martinon
FS Machado
G Chen
Geoffrey S. Ginsburg
GG Jackson
H Hemmi
H Yasukawa
IC Allen
J Andrejeva
J Faraway
J Pothlichet
Jay B. Varkey
JC Castelli
JC Sun
JD Storey
JE Fenner
JJ Goeman
JP Hugot
K Honda
KB Schwarz
KS Kobayashi
KS Schluns
L Pulliam
LA Joosten
Lawrence Carin
M Vandermeer
M Yamamoto
M Yoneyama
MD de Jong
Micah T. McClain
MJ Clemens
MJ Zilliox
N Dobigeon
N. Christine Øien
Nicholas J. Schork
Nicolas Dobigeon
NJ Cox
O Hössjer
P Bühlmann
P Bühlmann
P Palese
Peter J. Woolf
Q Zhu
RA Floyd
RB Turner
S Akira
S Kofler
SD Shapira
Stephen Kingsmore
T Ichinohe
T Kawai
T Kawai
T Kohonen
T Oda
TD Kanneganti
Timothy Veldman
Y Benjamini
Y Seki
Yongsheng Huang
Publication venue: Public Library of Science
Publication date: 01/08/2011
Field of study

Exposure to influenza viruses is necessary, but not sufficient, for healthy human hosts to develop symptomatic illness. The host response is an important determinant of disease progression. In order to delineate host molecular responses that differentiate symptomatic and asymptomatic Influenza A infection, we inoculated 17 healthy adults with live influenza (H3N2/Wisconsin) and examined changes in host peripheral blood gene expression at 16 timepoints over 132 hours. Here we present distinct transcriptional dynamics of host responses unique to asymptomatic and symptomatic infections. We show that symptomatic hosts invoke, simultaneously, multiple pattern recognition receptors-mediated antiviral and inflammatory responses that may relate to virus-induced oxidative stress. In contrast, asymptomatic subjects tightly regulate these responses and exhibit elevated expression of genes that function in antioxidant responses and cell-mediated responses. We reveal an ab initio molecular signature that strongly correlates to symptomatic clinical disease and biomarkers whose expression patterns best discriminate early from late phases of infection. Our results establish a temporal pattern of host molecular responses that differentiates symptomatic from asymptomatic infections and reveals an asymptomatic host-unique non-passive response signature, suggesting novel putative molecular targets for both prognostic assessment and ameliorative therapeutic intervention in seasonal and pandemic influenza

Public Library of Science (PLOS)

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

Directory of Open Access Journals

PubMed Central

DukeSpace

From basic to reduced bias kernel density estimators: links via Taylor series approximations

Author: Hössjer O.
Jones M. C.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/1996
Field of study

The transformation kernel density estimator of Ruppert and Cline (1994) achieves bias of order h4 (as the bandwidth h→0), an improvement over the order h2 bias associated with the basic kernel density estimator. Hössjer and Ruppert (1994) use Taylor series expansions to build a bridge between the two, displaying an infinite sequence of O(h4) bias estimators in the process. In this paper, we extend the work of Hössjer and Ruppert (i) by investigating three other natural Taylor series expansions, and (ii) by applying the approach to two other O(h4) bias estimators, namely the variable bandwidth and multiplicative bias correction methods. Several further infinite sequences of O(h4) bias estimators result

Crossref

Open Research Online (The Open University)

Generalizing univariate signed rank statistics for testing and estimating a multivariate location parameter

Author: Croux Christophe
Hössjer O
Publication venue: 'Informa UK Limited'
Publication date: 01/01/1995
Field of study

Lirias

On the effect of estimating the error density in nonparametric deconvolution

Author: Hössjer O.
Neumann Michael H.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/1997
Field of study

It is quite common in the statistical literature on nonparametric deconvolution to assume that the error density is perfectly known. Since this seems to be unrealistic in many practical applications, we study the effect of estimating the unknown error density. We derive minimax rates of convergence and propose a modification of the usual kernel-based estimation scheme, which takes the uncertainty about the error density into account. A simulation study quantifies the possible gains by this new method in finite sample situations

Crossref

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics