Search CORE

56 research outputs found

A Fast Algorithm for Robust Regression with Penalised Trimmed Squares

Author: A Giloni
AC Atkinson
AC Atkinson
AS Hadi
C Agostinelli
CW Coakley
D Gervini
D Peña
D Peña
DM Hawkins
DM Hawkins
DM Hawkins
DM Sebert
G Zioutas
G Zioutas
G. Zioutas
J Agulló
JF Gentleman
L. Pitsoulis
LM Li
LS Pitsoulis
M Salibian-Barrera
MS Bazaraa
N Billor
N Billor
N Billor
O Hössjer
PJ Rousseeuw
PJ Rousseeuw
PJ Rousseeuw
PJ Rousseeuw
RJ Rousseeuw
TA Feo
VJ Yohai
VJ Yohai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

The presence of groups containing high leverage outliers makes linear regression a difficult problem due to the masking effect. The available high breakdown estimators based on Least Trimmed Squares often do not succeed in detecting masked high leverage outliers in finite samples. An alternative to the LTS estimator, called Penalised Trimmed Squares (PTS) estimator, was introduced by the authors in \cite{ZiouAv:05,ZiAvPi:07} and it appears to be less sensitive to the masking problem. This estimator is defined by a Quadratic Mixed Integer Programming (QMIP) problem, where in the objective function a penalty cost for each observation is included which serves as an upper bound on the residual error for any feasible regression line. Since the PTS does not require presetting the number of outliers to delete from the data set, it has better efficiency with respect to other estimators. However, due to the high computational complexity of the resulting QMIP problem, exact solutions for moderately large regression problems is infeasible. In this paper we further establish the theoretical properties of the PTS estimator, such as high breakdown and efficiency, and propose an approximate algorithm called Fast-PTS to compute the PTS estimator for large data sets efficiently. Extensive computational experiments on sets of benchmark instances with varying degrees of outlier contamination, indicate that the proposed algorithm performs well in identifying groups of high leverage outliers in reasonable computational time.Comment: 27 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

Catering to the 'fringe:' a new approach to product development and market segmentation

Author: Giloni A.
Seshadri S.
Tucci Christopher L.
Publication venue
Publication date: 07/12/2005
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Neo-Rawlsian fringes: A new approach to market segmentation and product development

Author: Giloni A.
Seshadri S.
Tucci Christopher L.
Publication venue
Publication date: 13/12/2005
Field of study

A new approach to market segmentation and product development

Infoscience - École polytechnique fédérale de Lausanne

The Leeuwenhoek Lecture, 1982 - Studies of microbial products in rising to the challenge of curing cancer

Author: Aoyagi T.
Brockmann H.
Burger R. M.
Giloni L.
Kunimoto T.
Noel J. P.
Noma T.
Umezawa H.
Umezawa H.
Publication venue: 'The Royal Society'
Publication date
Field of study

Crossref

Filtering Outliers in One Step with Genetic Programming

Author: A Alfons
A Giloni
FA Fortin
I Gonçalves
L Trujillo
M Hubert
M Kotanchek
MA Fischler
ME Kotanchek
N Meinshausen
PH Torr
PJ Rousseeuw
R Pearson
RI Hartley
U López
V Chandola
VJ Hodge
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/09/2018
Field of study

Outliers are one of the most difficult issues when dealing with real-world modeling tasks. Even a small percentage of outliers can impede a learning algorithm’s ability to fit a dataset. While robust regression algorithms exist, they fail when a dataset is corrupted by more than 50% of outliers (breakdown point). In the case of Genetic Programming, robust regression has not been properly studied. In this paper we present a method that works as a filter, removing outliers from the target variable (vertical outliers). The algorithm is simple, it uses a randomly generated population of GP trees to determine which target values should be labeled as outliers. The method is highly efficient. Results show that it can return a clean dataset when contamination reaches as high as 90%, and may be able to handle higher levels of contamination. In this study only synthetic univariate benchmarks are used to evaluate the approach, but it must be stressed that no other approaches can deal with such high levels of outlier contamination while requiring such small computational effort

Crossref

INRIA a CCSD electronic archive server

Oskar Bordeaux

Dual Mechanisms of DNA Damage by MoCH3(h3-allyl)(CO)2(phen) Complexes

Author: Burrows C. J.
Connolly T. J.
Dizdaroglu M.
For
Frank B. L.
Friedberg E. C.
Giloni L.
Goldstein S.
Hecht S.
Huston P.
Interestingly
Meijier M. M.
Mohler D. L.
Pogozelski W. K.
Povirk L. F.
Pérez J.
Sitlani A.
Tenhaeff S. C.
Yamada M.
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref

Luminescence from the bleomycin + iron (II) + O 2

Author: Bonadonna G.
Burger R. M.
Burger R. M.
Giloni L.
Hüttenhofer A.
I. Kruk
J. Kladny
K. Lichszteld
Kanofsky J. R.
Kikuchi H.
Kuramochi H.
Murugesan N.
Nakamura M.
Sugiura Y.
T. Michalska
Takita T.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Oxygen Radical Formation During Redox Cycling of Bleomycin-Fe(III) Catalyzed by NADPH-Cytochrome P-450 Reductase of Liver Microsomes and Nuclei

Author: AP Grollman
BK Sinha
ChK Mirabelli
DR Bickers
EA Sausville
H Kappus
H Umezawa
I Yamazaki
JG Filser
JMC Gutteridge
KA Kennedy
L Giloni
M Romano
M Swanson
MA Trush
ME Scheulen
N. Yamanaka
NR Bachur
RE Kilkuskie
RM Burger
S Fleischer
WJ Arion
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1986
Field of study

Crossref