Search CORE

Data hosting infrastructure for primary biodiversity data

Author: A Güntsch
Anthony Goddard
C Bennett-McNew
C Zins
CC Harvey
DJ Patterson
G Hodge
G Yamashita
Grant Yamashita
J Dean
J Gray
J Howe
J Hunter
J Klump
MC Whitlock
MGI Langille
Nathan Wilson
P Leach
P-Y Hsueh
PB Heidorn
Phil Cryer
R Pyle
RJ Scholes
T Berners-Lee
V Smith
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

© The Author(s), 2011. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in BMC Bioinformatics 12 Suppl. 15 (2011): S5, doi:10.1186/1471-2105-12-S15-S5.Today, an unprecedented volume of primary biodiversity data are being generated worldwide, yet significant amounts of these data have been and will continue to be lost after the conclusion of the projects tasked with collecting them. To get the most value out of these data it is imperative to seek a solution whereby these data are rescued, archived and made available to the biodiversity community. To this end, the biodiversity informatics community requires investment in processes and infrastructure to mitigate data loss and provide solutions for long-term hosting and sharing of biodiversity data. We review the current state of biodiversity data hosting and investigate the technological and sociological barriers to proper data management. We further explore the rescuing and re-hosting of legacy data, the state of existing toolsets and propose a future direction for the development of new discovery tools. We also explore the role of data standards and licensing in the context of data hosting and preservation. We provide five recommendations for the biodiversity community that will foster better data preservation and access: (1) encourage the community's use of data standards, (2) promote the public domain licensing of data, (3) establish a community of those involved in data hosting and archival, (4) establish hosting centers for biodiversity data, and (5) develop tools for data discovery. The community's adoption of standards and development of tools to enable data discovery is essential to sustainable data preservation. Furthermore, the increased adoption of open content licensing, the establishment of data hosting infrastructure and the creation of a data hosting and archiving community are all necessary steps towards the community ensuring that data archival policies become standardized

Woods Hole Open Access Server

Springer - Publisher Connector

Public Library of Science (PLOS)

A Benchmark of Parametric Methods for Horizontal Transfers Detection

Author: A Carbone
A Tsirigos
B Wang
C Dufraigne
C Dutta
C Medigue
C Regeard
Cécile Churlaud
DQ Cortez
E Lerat
G Perriere
H Ochman
J Hacker
J Hacker
J Mrazek
JA Eisen
Jennifer Becq
JG Lawrence
JG Lawrence
JG Lawrence
JP Gogarten
JP Gogarten
L Koski
L Ruiting
M Hamady
M Ip
M Letek
M Poptsova
MA Ragan
MA Ragan
MA Ragan
MGI Langille
MGI Langille
MW van Passel
N Sueoka
N Sueoka
Olivier Neyrolles
P Deschavanne
P Lio
P Lio
Patrick Deschavanne
PJ Deschavanne
Q Tu
R Merkl
R Rolfe
RK Azad
RK Azad
S Garcia-Vallve
S Garcia-Vallvé
S Guindon
S Karlin
S Karlin
S Karlin
S Schjorring
S Waack
SD Hooper
SH Yoon
V Daubin
V Daubin
W Hsiao
WF Doolittle
WS Hayes
Y Nakamura
Publication venue: Public Library of Science
Publication date: 01/04/2010
Field of study

Horizontal gene transfer (HGT) has appeared to be of importance for prokaryotic species evolution. As a consequence numerous parametric methods, using only the information embedded in the genomes, have been designed to detect HGTs. Numerous reports of incongruencies in results of the different methods applied to the same genomes were published. The use of artificial genomes in which all HGT parameters are controlled allows testing different methods in the same conditions. The results of this benchmark concerning 16 representative parametric methods showed a great variety of efficiencies. Some methods work very poorly whatever the type of HGTs and some depend on the conditions or on the metrics used. The best methods in terms of total errors were those using tetranucleotides as criterion for the window methods or those using codon usage for gene based methods and the Kullback-Leibler divergence metric. Window methods are very sensitive but less specific and detect badly lone isolated gene. On the other hand gene based methods are often very specific but lack of sensitivity. We propose using two methods in combination to get the best of each category, a gene based one for specificity and a window based one for sensitivity

Public Library of Science (PLOS)

PIPS: Pathogenicity Island Prediction Software

The adaptability of pathogenic bacteria to hosts is influenced by the genomic plasticity of the bacteria, which can be increased by such mechanisms as horizontal gene transfer. Pathogenicity islands play a major role in this type of gene transfer because they are large, horizontally acquired regions that harbor clusters of virulence genes that mediate the adhesion, colonization, invasion, immune system evasion, and toxigenic properties of the acceptor organism. Currently, pathogenicity islands are mainly identified in silico based on various characteristic features: (1) deviations in codon usage, G+C content or dinucleotide frequency and (2) insertion sequences and/or tRNA genetic flanking regions together with transposase coding genes. Several computational techniques for identifying pathogenicity islands exist. However, most of these techniques are only directed at the detection of horizontally transferred genes and/or the absence of certain genomic regions of the pathogenic bacterium in closely related non-pathogenic species. Here, we present a novel software suite designed for the prediction of pathogenicity islands (pathogenicity island prediction software, or PIPS). In contrast to other existing tools, our approach is capable of utilizing multiple features for pathogenicity island detection in an integrative manner. We show that PIPS provides better accuracy than other available software packages. As an example, we used PIPS to study the veterinary pathogen Corynebacterium pseudotuberculosis, in which we identified seven putative pathogenicity islands

Publications at Bielefeld University

MPG.PuRe

Sequence of the hyperplastic genome of the naturally competent Thermus scotoductus SA-01

Author: A Friedrich
A Friedrich
A Friedrich
A Henne
ACE Darling
Antje Wollherr
B Averhoff
B Averhoff
B Ewing
B Ewing
Benjamin Kumwenda
C Bricio
C Cervantes
C Moller
C Nesbo
C Schwarzenlander
C Schwarzenlander
Carlos Bricio
D Chivian
D Mooser
D Slade
Derek Litthauer
DJ Opperman
DJ Opperman
DJ Opperman
DL Balkwill
DP Moser
E van Heerden
Elzbieta Brzuszkiewicz
Esta van Heerden
F Cava
F Cava
G Wanger
Gerhard Gottschalk
GI Omar
H Brüggemann
H Ganesan
Heiko Liesegang
I Narumi
J Janzon
J Mrazek
J Mrazek
JC Fisher
JG Lawrence
JK Bonfield
JK Fredrickson
José Berenguer
JR Lloyd
K Zahradka
Kamini Gounder
KM Handley
KM Handley
L Alvarez
LH Lin
LS Busenlehner
M De Grado
M de Grado
M de la Bastide
M Tanaka
M Wolfgang
M Wolfgang
Malay Srivastava
MGI Langille
MGI Langille
MJ Marshall
MV Omelchenko
N Ohtani
N Saitou
O Bezuidt
Oleg Reva
PA Bester
PD Karp
PD Karp
PD Karp
R Barrangou
RAD Williams
Rolf Daniel
S Graupner
S Karlin
S Silver
S Waack
S Whelan
SF Altschul
SF Altschul
TC Onstott
TL Kieft
TM Gihring
TM Gihring
W Hsiao
Y Agari
Y Koyama
YI Wolf
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Many strains of <it>Thermus </it>have been isolated from hot environments around the world. <it>Thermus scotoductus </it>SA-01 was isolated from fissure water collected 3.2 km below surface in a South African gold mine. The isolate is capable of dissimilatory iron reduction, growth with oxygen and nitrate as terminal electron acceptors and the ability to reduce a variety of metal ions, including gold, chromate and uranium, was demonstrated. The genomes from two different <it>Thermus thermophilus </it>strains have been completed. This paper represents the completed genome from a second <it>Thermus </it>species - <it>T. scotoductus</it>. Results The genome of <it>Thermus scotoductus </it>SA-01 consists of a chromosome of 2,346,803 bp and a small plasmid which, together are about 11% larger than the <it>Thermus thermophilus </it>genomes. The <it>T. thermophilus </it>megaplasmid genes are part of the <it>T. scotoductus </it>chromosome and extensive rearrangement, deletion of nonessential genes and acquisition of gene islands have occurred, leading to a loss of synteny between the chromosomes of <it>T. scotoductus and T. thermophilus</it>. At least nine large inserts of which seven were identified as alien, were found, the most remarkable being a denitrification cluster and two operons relating to the metabolism of phenolics which appear to have been acquired from <it>Meiothermus ruber</it>. The majority of acquired genes are from closely related species of the Deinococcus-Thermus group, and many of the remaining genes are from microorganisms with a thermophilic or hyperthermophilic lifestyle. The natural competence of <it>Thermus scotoductus </it>was confirmed experimentally as expected as most of the proteins of the natural transformation system of <it>Thermus thermophilus </it>are present. Analysis of the metabolic capabilities revealed an extensive energy metabolism with many aerobic and anaerobic respiratory options. An abundance of sensor histidine kinases, response regulators and transporters for a wide variety of compounds are indicative of an oligotrophic lifestyle. Conclusions The genome of <it>Thermus scotoductus </it>SA-01 shows remarkable plasticity with the loss, acquisition and rearrangement of large portions of its genome compared to <it>Thermus thermophilus</it>. Its ability to naturally take up foreign DNA has helped it adapt rapidly to a subsurface lifestyle in the presence of a dense and diverse population which acted as source of nutrients. The genome of <it>Thermus scotoductus </it>illustrates how rapid adaptation can be achieved by a highly dynamic and plastic genome.</p

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

National Research Foundation

Digital.CSIC

Biblos-e Archivo

UPSpace at the University of Pretoria

Satellite remote sensing data can be used to model marine microbial metabolite turnover

Author: A Ditchfield
A Toseland
A-B Martin-Cuadrado
AJ Southward
Anton F Post
B Pfeil
BB Jørgensen
Dawn Field
EL Barrett
FO Glöckner
J Ladau
J Yu
JA Fuhrman
JA Gilbert
JA Gilbert
JA Gilbert
JA Gilbert
Jack A Gilbert
JG Caporaso
KA Kilpatrick
KJ Popendorf
KL Carder
M Hügler
M Schmidt
MGI Langille
MJ Follows
N Fierer
N Fierer
NA Kamennaya
NA Kamennaya
Nicole Scott
NM Scott
OU Mason
PE Larsen
PE Larsen
Peter E Larsen
RD Graetz
RJW Brewin
RK Thauer
Rob Knight
S Archer
SC Doney
SM Gibbons
TJ Smyth
VA Smith
W Paul Bissett
X Wang
Yuki Hamada
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/07/2014
Field of study

Sampling ecosystems, even at a local scale, at the temporal and spatial resolution necessary to capture natural variability in microbial communities are prohibitively expensive. We extrapolated marine surface microbial community structure and metabolic potential from 72 16S rRNA amplicon and 8 metagenomic observations using remotely sensed environmental parameters to create a system-scale model of marine microbial metabolism for 5904 grid cells (49 km2) in the Western English Chanel, across 3 years of weekly averages. Thirteen environmental variables predicted the relative abundance of 24 bacterial Orders and 1715 unique enzyme-encoding genes that encode turnover of 2893 metabolites. The genes’ predicted relative abundance was highly correlated (Pearson Correlation 0.72, P-value <10−6) with their observed relative abundance in sequenced metagenomes. Predictions of the relative turnover (synthesis or consumption) of CO2 were significantly correlated with observed surface CO2 fugacity. The spatial and temporal variation in the predicted relative abundances of genes coding for cyanase, carbon monoxide and malate dehydrogenase were investigated along with the predicted inter-annual variation in relative consumption or production of ~3000 metabolites forming six significant temporal clusters. These spatiotemporal distributions could possibly be explained by the co-occurrence of anaerobic and aerobic metabolisms associated with localized plankton blooms or sediment resuspension, which facilitate the presence of anaerobic micro-niches. This predictive model provides a general framework for focusing future sampling and experimental design to relate biogeochemical turnover to microbial ecology

Woods Hole Open Access Server

eScholarship - University of California

NERC Open Research Archive

Heat-treated high-fat diet modifies gut microbiota and metabolic markers in apoe−/− mice

Author: A Manzel
A Rojas
AE Newton
BR Robertson
C Belzer
DJS Mills
E Tareke
Eden Tareke
F Bäckhed
Frida Fåk
G Kolovou
G Vistoli
GM Pasinetti
H Bjorkbacka
H Bukowska
H Vlassara
HY BaY
I Nemet
I Sekirov
J Caporaso
JB Parsons
JH McDonald
JW Baynes
K Shimizu
KR Clarke
M Brisslert
M Kanehisa
MGI Langille
MW Poulsen
N Segata
Nittaya Marungruang
NV Chuyen
P Alexiou
PA Finot
PD Cani
PD Cani
RC Edgar
RE Ley
T DeSantis
T Miyazawa
VR Velagapudi
W Parks Brian
YK Chuah
Z Hegab
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Public Library of Science (PLOS)

Computational Bacterial Genome-Wide Analysis of Phylogenetic Profiles Reveals Potential Virulence Genes of Streptococcus agalactiae

Author: A Casadevall
A Casadevall
A Hirsh
A Johri
A Kadioglu
A Perrin
B Cantarel
C Franken
C Phares
D Fredericks
D Raskin
E Marcotte
Enrico Coiera
F Lin
Fanrong Kong
Frank Po-Yen Lin
G Soong
Gwendolyn L. Gilbert
H Tettelin
H Tettelin
Herman Tse
I Sutcliffe
I Witten
J Claverys
J Hotopp
J Vert
J Wu
K Doran
M Bokarewa
M Jedrzejas
M Kanehisa
M Pellegrini
M Rhem
M Van Dyke
MGI Langille
N Salama
P Glaser
PM Bowers
R Gibbs
RD Finn
Ruiting Lan
S Chen
S Clarke
S Falkow
S Rooijakkers
S Schrag
SH Yoon
T Kato
T Wassenaar
Vitali Sintchenko
W Haas
W Haas
Y Xu
Y Yamanishi
Y Zheng
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

The phylogenetic profile of a gene is a reflection of its evolutionary history and can be defined as the differential presence or absence of a gene in a set of reference genomes. It has been employed to facilitate the prediction of gene functions. However, the hypothesis that the application of this concept can also facilitate the discovery of bacterial virulence factors has not been fully examined. In this paper, we test this hypothesis and report a computational pipeline designed to identify previously unknown bacterial virulence genes using group B streptococcus (GBS) as an example. Phylogenetic profiles of all GBS genes across 467 bacterial reference genomes were determined by candidate-against-all BLAST searches,which were then used to identify candidate virulence genes by machine learning models. Evaluation experiments with known GBS virulence genes suggested good functional and model consistency in cross-validation analyses (areas under ROC curve, 0.80 and 0.98 respectively). Inspection of the top-10 genes in each of the 15 virulence functional groups revealed at least 15 (of 119) homologous genes implicated in virulence in other human pathogens but previously unrecognized as potential virulence genes in GBS. Among these highly-ranked genes, many encode hypothetical proteins with possible roles in GBS virulence. Thus, our approach has led to the identification of a set of genes potentially affecting the virulence potential of GBS, which are potential candidates for further in vitro and in vivo investigations. This computational pipeline can also be extended to in silico analysis of virulence determinants of other bacterial pathogens

Macquarie University ResearchOnline

UNSWorks

The coral core microbiome identifies rare bacterial taxa as ubiquitous endosymbionts

Author: A Gonzalez
A Sellstedt
A Shade
A Shafquat
AW Thompson
C Cleland
C Rinke
C Robinson
C Roller
Celia Smith
CJ Krediet
D Bulgarelli
D Wangpraseurt
David G Bourne
DS Lundberg
Erika S Woolsey
F Bäckhed
G Rastogi
Gergely Torda
H Daims
H Sanguin
Heather L Spalding
HR Gruber-Vodicka
I Letunic
J Caporaso
J Decelle
JA Russell
Jacqueline L Padilla-Gamiño
Jean-Baptise Raina
JG Caporaso
JL Sachs
KB Ritchie
L Philippot
Lutz Krause
M McFall-Ngai
Martha Zakrzewski
MGI Langille
ML Sogin
MP Lesser
N Fierer
N Knowlton
OO Lee
Ove Hoegh-Guldberg
Pim Bongaerts
R Hayat
Ruth D Gates
S Kahng
S Sunagawa
T Bayer
TCG Bosch
TCG Bosch
TD Ainsworth
TD Ainsworth
TF Cooper
Thomas Bridge
Tracy D Ainsworth
TZ DeSantis
William Leggat
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

© 2015 International Society for Microbial Ecology All rights reserved. Despite being one of the simplest metazoans, corals harbor some of the most highly diverse and abundant microbial communities. Differentiating core, symbiotic bacteria from this diverse hostassociated consortium is essential for characterizing the functional contributions of bacteria but has not been possible yet. Here we characterize the coral core microbiome and demonstrate clear phylogenetic and functional divisions between the micro-scale, niche habitats within the coral host. In doing so, we discover seven distinct bacterial phylotypes that are universal to the core microbiome of coral species, separated by thousands of kilometres of oceans. The two most abundant phylotypes are co-localized specifically with the corals' endosymbiotic algae and symbiont-containing host cells. These bacterial symbioses likely facilitate the success of the dinoflagellate endosymbiosis with corals in diverse environmental regimes

ResearchOnline@JCU

OPUS - University of Technology Sydney

ResearchOnline at James Cook University