Search CORE

19 research outputs found

Benchmark of structured machine learning methods for microbial identification from mass-spectrometry data

Author: Mahé Pierre
Vert Jean-Philippe
Vervier Kévin
Veyrieras Jean-Baptiste
Publication venue
Publication date: 13/05/2015
Field of study

Microbial identification is a central issue in microbiology, in particular in the fields of infectious diseases diagnosis and industrial quality control. The concept of species is tightly linked to the concept of biological and clinical classification where the proximity between species is generally measured in terms of evolutionary distances and/or clinical phenotypes. Surprisingly, the information provided by this well-known hierarchical structure is rarely used by machine learning-based automatic microbial identification systems. Structured machine learning methods were recently proposed for taking into account the structure embedded in a hierarchy and using it as additional a priori information, and could therefore allow to improve microbial identification systems. We test and compare several state-of-the-art machine learning methods for microbial identification on a new Matrix-Assisted Laser Desorption/Ionization Time-of-Flight mass spectrometry (MALDI-TOF MS) dataset. We include in the benchmark standard and structured methods, that leverage the knowledge of the underlying hierarchical structure in the learning process. Our results show that although some methods perform better than others, structured methods do not consistently perform better than their "flat" counterparts. We postulate that this is partly due to the fact that standard methods already reach a high level of accuracy in this context, and that they mainly confuse species close to each other in the tree, a case where using the known hierarchy is not helpful

arXiv.org e-Print Archive

HAL-MINES ParisTech

Updating the Northern Tsetse Limit in Burkina Faso (1949–2009): Impact of Global Change

Author: Barclay
Ben Yahmed
Bouyer
Bouyer
Budd
Cecchi
Cecchi
Challier
Challier
Courtin
Courtin
De la Rocque
Desquesnes
D’Orgeval
Fabrice Courtin
Ford
Gado
Githeko
Gouzien
Gruvel
Guengant
Guerrini
Issa Sidibé
Issa Tamboura
Jean-Baptiste Rayaissé
Kabayo
Koudougou
Kubi
Laveissière
Laveissière
Mahé
Mandé
Nash
Oumar Serdébéogo
Paturel
Philippe Solano
Rayaisse
Reid
Rogers
Rogers
Rouamba
Roubaud
Solano
Solano
Zowindé Koudougou
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/01/2010
Field of study

The northern distribution limit of tsetse flies was updated in Burkina Faso and compared to previous limits to revise the existing map of these vectors of African trypanosomiases dating from several decades ago. From 1949 to 2009, a 25- to 150-km shift has appeared toward the south. Tsetse are now discontinuously distributed in Burkina Faso with a western and an eastern tsetse belt. This range shift can be explained by a combination of decreased rainfall and increased human density. Within a context of international control, this study provides a better understanding of the factors influencing the distribution of tsetse flies

Crossref

Directory of Open Access Journals

PubMed Central

Horizon / Pleins textes

A geografia médica e as expedições francesas para o Brasil: uma descrição da estação naval do Brasil e da Prata (1868-1870)

Crossref

Sélection de variables structurée par régularisation jointe dans un cadre multi-tâches

Author: Bonnefoy Antoine
Mahé Pierre
Ouamlil Ismael
Veyrieras Jean-Baptiste
Publication venue: HAL CCSD
Publication date: 05/07/2016
Field of study

National audienceMotivated by diagnostic applications in the field of clinical microbiology, we introduce a joint in-put/output regularization method to perform struc-tured variable selection in a multi-task setting where tasks can exhibit various degrees of correlation. Our approach extensively relies on the tree-structured group-lasso penalty and explicitly combines hierarchical structures defined across features and task by means of the Cartesian product of graphs to induce a global hierarchical group structure. A vectorization procedure is then used to solve the resulting multi-task problem with standard mono-task optimization algorithms developed for the overlapping group-lasso problem. Experimental results on simulated and real data demonstrate the interest of the approach

HAL AMU

On learning matrices with orthogonal columns or disjoint supports

Author: d'Aspremont Alexandre
Mahé Pierre
Vert Jean-Philippe
Vervier Kevin
Veyrieras Jean-Baptiste
Publication venue: HAL CCSD
Publication date: 13/04/2014
Field of study

16 pagesWe investigate new matrix penalties to jointly learn linear models with orthogonality constraints, generalizing the work of Xiao et al. [24] who proposed a strictly convex matrix norm for orthogonal trans- fer. We show that this norm converges to a particular atomic norm when its convexity parameter decreases, leading to new algorithmic solutions to minimize it. We also investigate concave formulations of this norm, corresponding to more aggressive strategies to induce orthogonality, and show how these penalties can also be used to learn sparse models with disjoint supports

HAL Descartes

Large-scale Machine Learning for Metagenomics Sequence Classification

Author: Mahé Pierre
Tournoud Maud
Vert Jean-Philippe
Vervier Kévin
Veyrieras Jean-Baptiste
Publication venue: HAL CCSD
Publication date: 12/05/2015
Field of study

Metagenomics characterizes the taxonomic diversity of microbial communities by sequencing DNA directly from an environmental sample. One of the main challenges in metagenomics data analysis is the binning step, where each sequenced read is assigned to a taxonomic clade. Due to the large volume of metagenomics datasets, binning methods need fast and accurate algorithms that can operate with reasonable computing requirements. While standard alignment-based methods provide state-of-the-art performance, compositional approaches that assign a taxonomic class to a DNA read based on the k-mers it contains have the potential to provide faster solutions. In this work, we investigate the potential of modern, large-scale machine learning implementations for taxonomic affectation of next-generation sequencing reads based on their k-mers profile. We show that machine learning-based compositional approaches benefit from increasing the number of fragments sampled from reference genome to tune their parameters, up to a coverage of about 10, and from increasing the k-mer size to about 12. Tuning these models involves training a machine learning model on about 10 8 samples in 10 7 dimensions, which is out of reach of standard soft-wares but can be done efficiently with modern implementations for large-scale machine learning. The resulting models are competitive in terms of accuracy with well-established alignment tools for problems involving a small to moderate number of candidate species, and for reasonable amounts of sequencing errors. We show, however, that compositional approaches are still limited in their ability to deal with problems involving a greater number of species, and more sensitive to sequencing errors. We finally confirm that compositional approach achieve faster prediction times, with a gain of 3 to 15 times with respect to the BWA-MEM short read mapper, depending on the number of candidate species and the level of sequencing noise

arXiv.org e-Print Archive

PubMed Central

HAL-MINES ParisTech

Large-scale machine learning for metagenomics sequence classification

Author: Agarwal
Bottou
Jean-Baptiste Veyrieras
Jean-Philippe Vert
Kévin Vervier
Lindner
Maud Tournoud
Pierre Mahé
Pruitt
Sonnenburg
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Therapy-related Myeloid Neoplasms in Patients With Chronic Lymphocytic Leukemia Who Received FCR/FC as Frontline Therapy

Author: Alix Baugier de Materre
Béatrice Mahé
Caroline Dartigeas
Charles Herbaux
Clotilde Bravetti
Cécile Tomowiak
Damien Roos-Weil
David Ghez
Fatiha Merabet
Florence Nguyen-Khac
Frédéric Davi
Jean-Baptiste Micol
Kamel Laribi
Karim Maloum
Lise Willems
Loïc Ysebaert
Marie C. Béné
Maud Voldoire
Ronan Le Calloch
Stéphane Leprêtre
Yamina Touileb
Publication venue: Wiley
Publication date: 11/05/2022
Field of study

Directory of Open Access Journals

PubMed Central

Varia

OpenEdition

Bulletin Bibliographique

Author: Allouche-Benayoun Joëlle
Altglas Véronique
Alves Daniel
Andrade Luis Martínez
Andézian Sossie
Arppe Tiina
Aubin Françoise
Aubin-Boltanski Emma
Baubérot Jean
Baylocq Sassoubre Cédric
Benveniste Annie
Blanchy Sophie
Bonhomme Julien
Béraud Céline
Bœspflug François
Casajus Dominique
Castella Lili
Champion Françoise
Chevalier Yves
Clémentin-Ojha Catherine
Coulmont Baptiste
Courcy Raymond
Davie Grace
Dehouve Danièle
Duchesne Véronique
Dumoncel Jean-Claude
Duteil-Ogata Fabienne
Fer Yannick
Frijhoff Willem
Fähndrich Hartmut
Girard Aurélien
Gruson Pascale
Gugelot Frédéric
Hurel Daniel-Odon
Jonveaux Isabelle
Kemnitz Eva-Maria von
Koussens David
Labéy-Guimard Guénolé
Langewiesche Katrin
Lassave Pierre
Laurant Jean-Pierre
Lautman Françoise
Le Mer Régis
Le Mer Regis
Löwy Michael
L’équipe de rédaction
Maes Bruno
Mahé Alain
Mary André
Massignon Bérengère
Michon Bruno
Mirzayantz Evan
Mossière Géraldine
Naïmi Mustapha
Nemo-Pekelman Capucine
Obadia Lionel
Ormières Jean-Louis
Ostenc Michel
Pace Enzo
Padoux André
Roux Rodolfo de
Roy Jean-Marc
Seraïdari Katerina
Sleiman André
Sleiman André Georges
Somé Magloire
Suremain Charles-Édouard de
Sère Bénédicte
Teyssier Ronan
Thielmann Jörn
Timotin Andrei
Turcotte Paul-André
Van den Kerchove Anna
Vermander Benoît
Vidal Daniel
Voyé Liliane
Youssef Youhanna
Zapponi Elena
Zomo Maixant Mebiame
Publication venue: 'OpenEdition'
Publication date: 01/01/2013
Field of study

À tous les lecteurs et collaborateurs du Bulletin Bibliographique des ASSR Pour la première fois cette année nous mettons en ligne les comptes-rendus sur note site Revues.org au rythme semestriel (juin / décembre) qui est celui des échéances du Bulletin Bibliographique. La totalité des recensions de l'année 2009 sera publiée dans notre numéro 148 (octobre-décembre)

OpenEdition