Search CORE

18 research outputs found

Exact Cross-Validation for k-NN in binary classification, applications to passive and active learning

Author: Celisse A.
Mary-Huard Tristan
Publication venue: Société Française de Statistique et Société Mathématique de France
Publication date: 01/01/2011
Field of study

International audienc

HAL Descartes

Numérisation de Documents Anciens Mathématiques

Hal-Diderot

Détection de ruptures à partir de méthodes à noyaux

Author: Celisse A.
Marot G.
Pierre-Jean M.
Rigaill G.
Publication venue: HAL CCSD
Publication date: 24/06/2013
Field of study

HAL Evry

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Recommended from our members

Treatment patterns and outcomes for pancreatic tumors in children: an analysis of the National Cancer Database

Author: Ferrantella Anthony
Perez Eduardo A
Picado Omar
Rao Krishnamurti
Sola Juan E
Thorson Chad M
Zabalo Celisse
Publication venue: Germany
Publication date: 01/03/2020
Field of study

Pancreatic tumors are rare in children and limited data are available regarding incidence, treatment, and outcomes. We aim to describe patient and tumor characteristics and to report on survival of these diseases. Children with pancreatic tumors were queried from the National Cancer Database (2004-2014). The association between treatment and hazard of death was assessed using Kaplan-Meier method and Cox regression model. We identified 109 children with pancreatic tumors; 52% were male and median age at diagnosis was 14 years. Tumors were distributed as follows: pseudopapillary neoplasm (30%), endocrine tumors (27%), pancreatoblastoma (16%), pancreatic adenocarcinoma (16%), sarcoma (6%) and neuroblastoma (5%). Seventy-nine patients underwent surgery, of which 76% achieved R0 resection. Most patients (85%) had lymph nodes examined, of which 22% had positive nodes. Five-year overall survival by tumor histology was 95% (pseudopapillary neoplasm), 75% (neuroblastoma), 70% (pancreatoblastoma), 51% (endocrine tumors), 43% (sarcoma), and 34% (adenocarcinoma). On multivariable analysis, surgical resection was the strongest predictor of survival (HR 0.26, 95% CI 0.10-0.68, p < 0.01). Overall survival of children with pancreatic tumors is grim, with varying survival rates among different tumors. Surgical resection is associated with improved long-term survival

University of Miami: Scholarship Miami

Adjacency-constrained hierarchical clustering of a band similarity matrix with application to Genomics

Author: A Celisse
A Dehman
Alia Dehman
C Dalmasso
Christophe Ambroise
D Eppstein
D Garreau
E Grimm
ENCODE Project Consortium
F Guinot
F Picard
FB Baker
GJ Székely
Guillem Rigaill
J Dixon
J Fraser
J Qin
JA Reuter
JWJ Williams
KD Bennett
L Hubert
L Kaufman
M Chavent
M Forcato
N Aronszajn
N Haddad
Nathalie Vialaneix
Pierre Neuvial
SB Gabriel
T Strauss
TD Hocking
V Michel
Y Shen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

International audienceMotivation: Genomic data analyses such as Genome-Wide Association Studies (GWAS) or Hi-C studies are often faced with the problem of partitioning chromosomes into successive regions based on a similarity matrix of high-resolution, locus-level measurements. An intuitive way of doing this is to perform a modified Hierarchical Agglomerative Clustering (HAC), where only adjacent clusters (according to the ordering of positions within a chromosome) are allowed to be merged. A major practical drawback of this method is its quadratic time and space complexity in the number of loci, which is typically of the order of 10^4 to 10^5 for each chromosome. Results: By assuming that the similarity between physically distant objects is negligible, we propose an implementation of this adjacency-constrained HAC with quasi-linear complexity. Our illustrations on GWAS and Hi-C datasets demonstrate the relevance of this assumption, and show that this method highlights biologically meaningful signals. Thanks to its small time and memory footprint, the method can be run on a standard laptop in minutes or even seconds. Availability and Implementation: Software and sample data are available as an R package, adjclust, that can be downloaded from the Comprehensive R Archive Network (CRAN)

arXiv.org e-Print Archive

HAL Evry

Crossref

Scientific Publications of the University of Toulouse II Le Mirail