Interpreting 16S metagenomic data without clustering to achieve sub-OTU
  resolution

A Klindworth; A Shade; A Shade; AM Eren; BJ Haas; C Huttenhower; C Lozupone; C Quince; C Quince; DE Hunt; DN Fredricks; EK Costello; EK Costello; H Ochman; JG Caporaso; JG Caporaso; JI Prosser; JJ Faith; JL VandeWalle; JR Brestoff; M Hamady; MGI Langille; Mikhail Tikhonov; MJ Morgan; MJ Rosen; N Fierer; N Kamada; ND Youngblut; Ned S Wingreen; O Lukjancenko; PD Schloss; PD Schloss; PD Schloss; PJ Turnbaugh; RC Edgar; RC Edgar; RC Edgar; Robert W Leach; SJ Song; SM Huse; SP Preheim; TP Tourova; V Kunin; WJ Sul; Y Huang; ZJ Zheng

research

Interpreting 16S metagenomic data without clustering to achieve sub-OTU resolution

Authors: A Klindworth
A Shade
A Shade
AM Eren
BJ Haas
C Huttenhower
C Lozupone
C Quince
C Quince
DE Hunt
DN Fredricks
EK Costello
EK Costello
H Ochman
JG Caporaso
JG Caporaso
JI Prosser
JJ Faith
JL VandeWalle
JR Brestoff
M Hamady
MGI Langille
Mikhail Tikhonov
MJ Morgan
MJ Rosen
N Fierer
N Kamada
ND Youngblut
Ned S Wingreen
O Lukjancenko
PD Schloss
PD Schloss
PD Schloss
PJ Turnbaugh
RC Edgar
RC Edgar
RC Edgar
Robert W Leach
SJ Song
SM Huse
SP Preheim
TP Tourova
V Kunin
WJ Sul
Y Huang
ZJ Zheng
Publication date: 11 July 2014
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

The standard approach to analyzing 16S tag sequence data, which relies on clustering reads by sequence similarity into Operational Taxonomic Units (OTUs), underexploits the accuracy of modern sequencing technology. We present a clustering-free approach to multi-sample Illumina datasets that can identify independent bacterial subpopulations regardless of the similarity of their 16S tag sequences. Using published data from a longitudinal time-series study of human tongue microbiota, we are able to resolve within standard 97% similarity OTUs up to 20 distinct subpopulations, all ecologically distinct but with 16S tags differing by as little as 1 nucleotide (99.2% similarity). A comparative analysis of oral communities of two cohabiting individuals reveals that most such subpopulations are shared between the two communities at 100% sequence identity, and that dynamical similarity between subpopulations in one host is strongly predictive of dynamical similarity between the same subpopulations in the other host. Our method can also be applied to samples collected in cross-sectional studies and can be used with the 454 sequencing platform. We discuss how the sub-OTU resolution of our approach can provide new insight into factors shaping community assembly.Comment: Updated to match the published version. 12 pages, 5 figures + supplement. Significantly revised for clarity, references added, results not change

Similar works

Full text

Available Versions

Crossref

info:doi/10.1038%2Fismej.2014....

Last time updated on 03/12/2019

Sustaining member

Princeton University Open Access Repository

oai:oar.princeton.edu:88435/pr...

Last time updated on 14/02/2024