Search CORE

195 research outputs found

Assessing the Diversity and Specificity of Two Freshwater Viral Communities through Metagenomics

Transitions between saline and fresh waters have been shown to be infrequent for microorganisms. Based on host-specific interactions, the presence of specific clades among hosts suggests the existence of freshwater-specific viral clades. Yet, little is known about the composition and diversity of the temperate freshwater viral communities, and even if freshwater lakes and marine waters harbor distinct clades for particular viral sub-families, this distinction remains to be demonstrated on a community scale

Public Library of Science (PLOS)

Crossref

Hal - Université Grenoble Alpes

HAL Clermont Université

Directory of Open Access Journals

PubMed Central

Agritrop

HAL Université de Savoie

FigShare

The P-SSP7 Cyanophage Has a Linear Genome with Direct Terminal Repeats

Author: A Millard
D Lindell
D Lindell
D Lindell
Debbie Lindell
F Partensky
FE Angly
G Sabehi
G Zeidner
Gazalah Sabehi
Gil Ast
IJ Molineux
J Sambrook
LR Moore
LW Black
MB Sullivan
MB Sullivan
MB Sullivan
MR Henn
NH Mann
SR Bench
WH Pope
X Liu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

P-SSP7 is a T7-like phage that infects the cyanobacterium Prochlorococcus MED4. MED4 is a member of the high-light-adapted Prochlorococcus ecotypes that are abundant in the surface oceans and contribute significantly to primary production. P-SSP7 has become a model system for the investigation of T7-like phages that infect Prochlorococcus. It was classified as T7-like based on genome content and organization. However, because its genome assembled as a circular molecule, it was thought to be circularly permuted and to lack the direct terminal repeats found in other T7-like phages. Here we sequenced the ends of the P-SSP7 genome and found that the genome map is linear and contains a 206 bp repeat at both genome ends. Furthermore, we found that a 728 bp region of the genome originally placed downstream of the last ORF is actually located upstream of the first ORF on the genome map. These findings suggest that P-SSP7 is likely to use the direct terminal repeats for genome replication and packaging in a similar manner to other T7-like phages. Moreover, these results highlight the importance of experimentally verifying the ends of phage genomes, and will facilitate the use of P-SSP7 as a model for the correct assembly and end determination of the many T7-like phages isolated from the marine environment that are currently being sequenced

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Deep sequencing evidence from single grapevine plants reveals a virome dominated by mycoviruses

Author: A Djikeng
A López-Bueno
A. Rowhani
B Coetzee
CM Fauquet
F. Cordero
FE Angly
G Routh
GP Martelli
HN Pearson
J Atif
J. R. Úrbez-Torres
JR Úrbez-Torres
L Sage
LJ Crawford
M Al Rwahnih
M. Al Rwahnih
RA Edwards
S Nakamura
S. Daubert
SF Altschul
TJ White
YP Zhang
Publication venue: Springer Vienna
Publication date: 01/01/2010
Field of study

We have characterized the virome in single grapevines by 454 high-throughput sequencing of double-stranded RNA recovered from the vine stem. The analysis revealed a substantial set of sequences similar to those of fungal viruses. Twenty-six putative fungal virus groups were identified from a single plant source. These represented half of all known mycoviral families including the Chrysoviridae, Hypoviridae, Narnaviridae, Partitiviridae, and Totiviridae. Three of the mycoviruses were associated with Botrytis cinerea, a common fungal pathogen of grapes. Most of the rest appeared to be undescribed. The presence of viral sequences identified by BLAST analysis was confirmed by sequencing PCR products generated from the starting material using primers designed from the genomic sequences of putative mycoviruses. To further characterize these sequences as fungal viruses, fungi from the grapevine tissue were cultured and screened with the same PCR probes. Five of the mycoviruses identified in the total grapevine extract were identified again in extracts of the fungal cultures

Crossref

Springer - Publisher Connector

PubMed Central

Metagenomic Analysis of Lysogeny in Tampa Bay: Implications for Prophage Gene Expression

Author: Amy Long
C Desnues
C Leitet
CA Suttle
CA Suttle
D Stopar
DB Rusch
DJ Carpenter
DL Kirchman
EA Dinsdale
EF DeLong
F Angly
F Chen
F Rohwer
FE Angly
Forest Rohwer
Geraldine Butler
HW Ackermann
HW Ackermann
I Sharon
J Laybourn-Parry
J Sambrook
JA Fuhrman
Jennifer Mobberley
JH Paul
John H. Paul
KE Wommack
KE Wommack
L McDaniel
L McDaniel
Lauren McDaniel
M Margulies
Matthew Haynes
MB Sullivan
MG Weinbauer
Mya Breitbart
SC Jiang
SC Jiang
SJ Williamson
SJ Williamson
SR Bench
SW Wilhem
X Mou
Ø Bergh
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Phage integrase genes often play a role in the establishment of lysogeny in temperate phage by catalyzing the integration of the phage into one of the host's replicons. To investigate temperate phage gene expression, an induced viral metagenome from Tampa Bay was sequenced by 454/Pyrosequencing. The sequencing yielded 294,068 reads with 6.6% identifiable. One hundred-three sequences had significant similarity to integrases by BLASTX analysis (e≤0.001). Four sequences with strongest amino-acid level similarity to integrases were selected and real-time PCR primers and probes were designed. Initial testing with microbial fraction DNA from Tampa Bay revealed 1.9×107, and 1300 gene copies of Vibrio-like integrase and Oceanicola-like integrase L−1 respectively. The other two integrases were not detected. The integrase assay was then tested on microbial fraction RNA extracted from 200 ml of Tampa Bay water sampled biweekly over a 12 month time series. Vibrio-like integrase gene expression was detected in three samples, with estimated copy numbers of 2.4-1280 L−1. Clostridium-like integrase gene expression was detected in 6 samples, with estimated copy numbers of 37 to 265 L−1. In all cases, detection of integrase gene expression corresponded to the occurrence of lysogeny as detected by prophage induction. Investigation of the environmental distribution of the two expressed integrases in the Global Ocean Survey Database found the Vibrio-like integrase was present in genome equivalents of 3.14% of microbial libraries and all four viral metagenomes. There were two similar genes in the library from British Columbia and one similar gene was detected in both the Gulf of Mexico and Sargasso Sea libraries. In contrast, in the Arctic library eleven similar genes were observed. The Clostridium-like integrase was less prevalent, being found in 0.58% of the microbial and none of the viral libraries. These results underscore the value of metagenomic data in discovering signature genes that play important roles in the environment through their expression, as demonstrated by integrases in lysogeny

Public Library of Science (PLOS)

Crossref

USFSP Digital Archive

Directory of Open Access Journals

PubMed Central

Scholar Commons - University of South Florida

Assessment of Metagenomic Assembly Using Simulated Next Generation Sequencing Data

Author: AH Singh
Aino I. Järvelin
Alison S. Waller
B Ewing
B Ewing
CB Abulencia
D Chivian
D Wu
Daniel R. Mende
DC Richter
DR Zerbino
ED Harrington
ES Lander
EW Myers
F Meyer
FE Angly
FE Angly
GW Tyson
H García Martín
H-H Chou
J Goecks
J Goll
J Handelsman
J Muller
J Peterson
J Qin
J Raes
J Raes
JC Venter
Jeroen Raes
John Parkinson
JR Miller
JR Miller
K Kurokawa
K Mavromatis
M Arumugam
M Arumugam
M Pignatelli
M Pop
Manimozhiyan Arumugam
Michelle M. Chan
MP Cox
Peer Bork
PJ Turnbaugh
PJA Cock
R Li
R Li
R Schmieder
RA Edwards
RL Warren
S Aparicio
SG Tringe
Shinichi Sunagawa
SR Gill
T Schoenfeld
TA Gianoulis
TC Glenn
VM Markowitz
W Zhu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Due to the complexity of the protocols and a limited knowledge of the nature of microbial communities, simulating metagenomic sequences plays an important role in testing the performance of existing tools and data analysis methods with metagenomic data. We developed metagenomic read simulators with platform-specific (Sanger, pyrosequencing, Illumina) base-error models, and simulated metagenomes of differing community complexities. We first evaluated the effect of rigorous quality control on Illumina data. Although quality filtering removed a large proportion of the data, it greatly improved the accuracy and contig lengths of resulting assemblies. We then compared the quality-trimmed Illumina assemblies to those from Sanger and pyrosequencing. For the simple community (10 genomes) all sequencing technologies assembled a similar amount and accurately represented the expected functional composition. For the more complex community (100 genomes) Illumina produced the best assemblies and more correctly resembled the expected functional composition. For the most complex community (400 genomes) there was very little assembly of reads from any sequencing technology. However, due to the longer read length the Sanger reads still represented the overall functional composition reasonably well. We further examined the effect of scaffolding of contigs using paired-end Illumina reads. It dramatically increased contig lengths of the simple community and yielded minor improvements to the more complex communities. Although the increase in contig length was accompanied by increased chimericity, it resulted in more complete genes and a better characterization of the functional repertoire. The metagenomic simulators developed for this research are freely available

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Copenhagen University Research Information System

MDC Repository

FigShare

Phage Encoded H-NS: A Potential Achilles Heel in the Bacterial Defence System

The relationship between phage and their microbial hosts is difficult to elucidate in complex natural ecosystems. Engineered systems performing enhanced biological phosphorus removal (EBPR), offer stable, lower complexity communities for studying phage-host interactions. Here, metagenomic data from an EBPR reactor dominated by Candidatus Accumulibacter phosphatis (CAP), led to the recovery of three complete and six partial phage genomes. Heat-stable nucleoid structuring (H-NS) protein, a global transcriptional repressor in bacteria, was identified in one of the complete phage genomes (EPV1), and was most similar to a homolog in CAP. We infer that EPV1 is a CAP-specific phage and has the potential to repress up to 6% of host genes based on the presence of putative H-NS binding sites in the CAP genome. These genes include CRISPR associated proteins and a Type III restriction-modification system, which are key host defense mechanisms against phage infection. Further, EPV1 was the only member of the phage community found in an EBPR microbial metagenome collected seven months prior. We propose that EPV1 laterally acquired H-NS from CAP providing it with a means to reduce bacterial defenses, a selective advantage over other phage in the EBPR system. Phage encoded H-NS could constitute a previously unrecognized weapon in the phage-host arms race

CiteSeerX

Public Library of Science (PLOS)

Crossref

USFSP Digital Archive

Directory of Open Access Journals

PubMed Central

Queensland University of Technology ePrints Archive

Scholar Commons - University of South Florida

University of Queensland eSpace

Analysis and comparison of very large metagenomes with fast clustering and functional annotation

Author: AC McHardy
AR Quinlan
B Rodriguez-Brito
D Sheskin
DB Rusch
DC Richter
DH Huson
E Portugaly
EA Dinsdale
EF DeLong
FE Angly
GW Tyson
H Noguchi
H Noguchi
H Teeling
H Teeling
J Shendure
JC Venter
K Mavromatis
KJ Hoff
L Krause
PD Schloss
R Seshadri
RK Aziz
S Yooseph
S Yooseph
SF Altschul
SG Tringe
SR Eddy
SR Gill
W Li
W Li
W Li
W Li
Weizhong Li
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The remarkable advance of metagenomics presents significant new challenges in data analysis. Metagenomic datasets (metagenomes) are large collections of sequencing reads from anonymous species within particular environments. Computational analyses for very large metagenomes are extremely time-consuming, and there are often many novel sequences in these metagenomes that are not fully utilized. The number of available metagenomes is rapidly increasing, so fast and efficient metagenome comparison methods are in great demand. Results The new metagenomic data analysis method Rapid Analysis of Multiple Metagenomes with a Clustering and Annotation Pipeline (RAMMCAP) was developed using an ultra-fast sequence clustering algorithm, fast protein family annotation tools, and a novel statistical metagenome comparison method that employs a unique graphic interface. RAMMCAP processes extremely large datasets with only moderate computational effort. It identifies raw read clusters and protein clusters that may include novel gene families, and compares metagenomes using clusters or functional annotations calculated by RAMMCAP. In this study, RAMMCAP was applied to the two largest available metagenomic collections, the "Global Ocean Sampling" and the "Metagenomic Profiling of Nine Biomes". Conclusion RAMMCAP is a very fast method that can cluster and annotate one million metagenomic reads in only hundreds of CPU hours. It is available from <url>http://tools.camera.calit2.net/camera/rammcap/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Metagenomic Analysis of Respiratory Tract DNA Viral Communities in Cystic Fibrosis and Non-Cystic Fibrosis Individuals

Author: A Livraghi
AF Andersson
AJ Gentles
B Rodriguez-Brito
Bahador Nosrat
BE van Ewijk
BS Everitt
C Desnues
C Goerke
D Willner
Dana Willner
DF Rogers
DM Raskin
Douglas Conrad
EA Dinsdale
F Angly
F Harrison
F Klein
F Meyer
F Rohwer
F Wartha
FB Dean
FE Angly
Florent E. Angly
Forest Rohwer
GB Rogers
GB Rogers
GB Winnie
H Ochman
H See
J Azeredo
J Heyder
JA Fuhrman
Jeffrey A. Gold
JM Corne
Joas Silva
K Potrykus
KL Palmer
KL Palmer
L Zawadzka-Głos
LL Kulczycki
M Breitbart
M Breitbart
M Breitbart
Matthew Haynes
Mike Furlan
MJ Goldman
MR Knowles
P Green
P Lohavanichbutr
PJ Turnbaugh
PJ Turnbaugh
PM Beringer
R Overbeek
R Pinard
RK Aziz
Robert Schmieder
RV Miller
RV Thurber
S Nakamura
Sassan Tammadoni
SF Altschul
SG Tringe
SH Randell
SR Bencht
SR Gill
T Allander
T Schoenfeld
T Vadivukarasi
T Zhang
TE McManus
V Jain
WT Liu
X Xiang
Publication venue: Public Library of Science
Publication date: 09/10/2009
Field of study

The human respiratory tract is constantly exposed to a wide variety of viruses, microbes and inorganic particulates from environmental air, water and food. Physical characteristics of inhaled particles and airway mucosal immunity determine which viruses and microbes will persist in the airways. Here we present the first metagenomic study of DNA viral communities in the airways of diseased and non-diseased individuals. We obtained sequences from sputum DNA viral communities in 5 individuals with cystic fibrosis (CF) and 5 individuals without the disease. Overall, diversity of viruses in the airways was low, with an average richness of 175 distinct viral genotypes. The majority of viral diversity was uncharacterized. CF phage communities were highly similar to each other, whereas Non-CF individuals had more distinct phage communities, which may reflect organisms in inhaled air. CF eukaryotic viral communities were dominated by a few viruses, including human herpesviruses and retroviruses. Functional metagenomics showed that all Non-CF viromes were similar, and that CF viromes were enriched in aromatic amino acid metabolism. The CF metagenomes occupied two different metabolic states, probably reflecting different disease states. There was one outlying CF virome which was characterized by an over-representation of Guanosine-5′-triphosphate,3′-diphosphate pyrophosphatase, an enzyme involved in the bacterial stringent response. Unique environments like the CF airway can drive functional adaptations, leading to shifts in metabolic profiles. These results have important clinical implications for CF, indicating that therapeutic measures may be more effective if used to change the respiratory environment, as opposed to shifting the taxonomic composition of resident microbiota

Public Library of Science (PLOS)

Crossref

PubMed Central

The GAAS Metagenomic Tool and Its Estimations of Viral and Microbial Average Genome Size in Four Major Biomes

Author: AC Paoletti
Alejandra Prieto-Davó
B Diez
B Zybailov
Baoli Zhu
Beltran Rodriguez-Mueller
C Desnues
Christelle Desnues
D Rasko
D Willner
Dana Willner
David L. Kirchman
DH Huson
Dionysios A. Antonopoulos
DL Wheeler
EA Dinsdale
EA Dinsdale
Egbert Mundt
Elizabeth A. Dinsdale
F Angly
F Meyer
F Rohwer
FE Angly
Florent E. Angly
FM Lauro
Folker Meyer
Forest Rohwer
Gary D. Stormo
GF Steward
I Hewson
I Letunic
J Raes
J Raes
JAG Ranea
John D. McPherson
K Holmfeldt
K Rosario
Katie Barott
KE Wommack
KE Wommack
KT Konstantinidis
L Florens
LB Koski
Linda Wegley
Lixin Zhang
LM Graves
M Dyall-Smith
M Pignatelli
Matthew Haynes
Matthew R. Henn
Matthew T. Cottrell
MG Weinbauer
Mike Furlan
P DasSarma
P Hugenholtz
R Sadreyev
R Sandaa
R Sandaa
R Seshadri
R. Michael Miller
Rebecca Vega-Thurber
Rick Stevens
RL Vega Thurber
Robert A. Edwards
Robert K. Naviaux
Robert Schmieder
RV Thurber
S Karlin
SD Bentley
SF Altschul
Tracey McDole
Yongfei Hu
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites) suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and environmental conditions

HAL AMU

Directory of Open Access Journals

HAL Descartes

DigitalCommons@Florida International University

Hal-Diderot

University of Queensland eSpace

Public Library of Science (PLOS)

eScholarship - University of California

ScholarlyCommons@Penn

Bacterial Genomes: Habitat Specificity and Uncharted Organisms

Author: A Bernal
C Pedrós-Alió
D Wu
EA Dinsdale
FE Angly
Fernando Dini Andreote
Francisco Dini-Andreote
GR Burke
H Toh
J Raes
JA Gilbert
Jack T. Trevors
JAG Ranea
Jan Dirk van Elsas
JE Barrick
JK Harris
JT Trevors
L Oksana
L Philippot
M Touchon
M Wagner
ML Sogin
NR Pace
P Lapierre
P Yilmaz
PKH Lee
RT Jones
S Abby
SG Tringe
T Ishoey
T Woyke
TM Vogel
Welington Luiz Araújo
Publication venue: Springer-Verlag
Publication date: 01/01/2012
Field of study

The capability and speed in generating genomic data have increased profoundly since the release of the draft human genome in 2000. Additionally, sequencing costs have continued to plummet as the next generation of highly efficient sequencing technologies (next-generation sequencing) became available and commercial facilities promote market competition. However, new challenges have emerged as researchers attempt to efficiently process the massive amounts of sequence data being generated. First, the described genome sequences are unequally distributed among the branches of bacterial life and, second, bacterial pan-genomes are often not considered when setting aims for sequencing projects. Here, we propose that scientists should be concerned with attaining an improved equal representation of most of the bacterial tree of life organisms, at the genomic level. Moreover, they should take into account the natural variation that is often observed within bacterial species and the role of the often changing surrounding environment and natural selection pressures, which is central to bacterial speciation and genome evolution. Not only will such efforts contribute to our overall understanding of the microbial diversity extant in ecosystems as well as the structuring of the extant genomes, but they will also facilitate the development of better methods for (meta)genome annotation

Crossref

Proceedings - University of Groningen

University of Groningen

Springer - Publisher Connector

ARTS repository - University of Groningen

PubMed Central

Dissertations of the University of Groningen