Search CORE

19 research outputs found

Additional file 1: of MICRA: an automatic pipeline for fast characterization of microbial genomes from high-throughput sequencing data

Author: Alexandre Loywick (3605780)
Christophe Audebert (3605774)
David Hot (120264)
GaĂŤl Even (4705801)
SĂŠgolĂ¨ne Caboche (4705798)
Publication venue
Publication date: 01/12/2017
Field of study

Supplementary figures, notes, and tables. (PDF 3297 kb

Directory of Open Access Journals

FigShare

Comparison of F-measures between the 200(V3) and 400(V4-V5) amplicon at the family level (left) and at the genus level (right) on the HC 50k dataset with error simulation.

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

Comparison of F-measures between the 200(V3) and 400(V4-V5) amplicon at the family level (left) and at the genus level (right) on the HC 50k dataset with error simulation.</p

FigShare

Schematic overview of the evaluation protocol.

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

Schematic overview of the evaluation protocol.</p

FigShare

Distinctions between clustering-first and assignment-first approaches.

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

A question mark indicates an unclassified read and/or taxon.</p

FigShare

Chao1 values before taxonomic merging for clustering-first pipelines, and at the family level after taxonomic merging for all pipelines, at three different complexities, on the 50k 200(V3) with error simulation datasets.

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

LC, MC and HC were all composed of 50 bacterial families, in varying proportions.</p

FigShare

Comparison of the richness (Chao1) and diversity (Inverse Simpson) indexes for clustering-first pipelines before taxonomic merging, on the 200(V3) HC dataset with sequencing errors simulation when generating 25k, 50k and 100k sequences.

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

Comparison of the richness (Chao1) and diversity (Inverse Simpson) indexes for clustering-first pipelines before taxonomic merging, on the 200(V3) HC dataset with sequencing errors simulation when generating 25k, 50k and 100k sequences.</p

FigShare

Proportions of the top 10 families per pipeline on the LC, MC and HC 50k 200(V3) with error simulation datasets, and their matching 1-NID clustering indexes (computed after taxonomic merging) at the genus and family levels.

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

Proportions of the top 10 families per pipeline on the LC, MC and HC 50k 200(V3) with error simulation datasets, and their matching 1-NID clustering indexes (computed after taxonomic merging) at the genus and family levels.</p

FigShare

F-measure and richness index error percentage after taxonomic merging for each pipeline on the 200(V3) 50k HC dataset with error simulation, when using different databases (the recommended database for each pipeline is marked with *).

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

F-measure and richness index error percentage after taxonomic merging for each pipeline on the 200(V3) 50k HC dataset with error simulation, when using different databases (the recommended database for each pipeline is marked with *).</p

FigShare

Comparison of F-measures (top) and richness error (bottom) in the error-free and error-prone sequencing models on the 200(V3) HC 50k dataset.

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

Comparison of F-measures (top) and richness error (bottom) in the error-free and error-prone sequencing models on the 200(V3) HC 50k dataset.</p

FigShare

Proportions of the top 10 families per pipeline on a real dataset, and their matching Chao1 diversity indexes (computed after taxonomic merging) at the family level.

Author: Christophe Audebert (3605774)
David Hot (120264)
Hélène Touzet (65840)
Léa Siegwald (3615494)
Ségolène Caboche (3615491)
Yves Lemoine (2288080)
Publication venue
Publication date
Field of study

Below, average linkage hierarchical clustering of all pipelines based on a Euclidean distance calculation on the amount on all reads per family per pipeline (excluding unclassified reads). Pipelines are marked with a * when executed with their default database.</p

FigShare