Search CORE

300 research outputs found

ProTISA: a comprehensive resource for translation initiation site annotation in prokaryotic genomes

Author: Aivaliotis
Bailey
Besemer
Chang
Crooks
Frishman
G.-Q. Hu
H. Zhu
Hershberg
Londei
Ma
Moll
Ou
P. Ortet
Poole
Rudd
Sazuka
Starmer
Strohl
Suzek
Torarinsson
Wu
X. Zheng
Y.-F. Yang
Z.-S. She
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

Correct annotation of translation initiation site (TIS) is essential for both experiments and bioinformatics studies of prokaryotic translation initiation mechanism as well as understanding of gene regulation and gene structure. Here we describe a comprehensive database ProTISA, which collects TIS confirmed through a variety of available evidences for prokaryotic genomes, including Swiss-Prot experiments record, literature, conserved domain hits and sequence alignment between orthologous genes. Moreover, by combining the predictions from our recently developed TIS post-processor, ProTISA provides a refined annotation for the public database RefSeq. Furthermore, the database annotates the potential regulatory signals associated with translation initiation at the TIS upstream region. As of July 2007, ProTISA includes 440 microbial genomes with more than 390 000 confirmed TISs. The database is available at http://mech.ctb.pku.edu.cn/protis

Crossref

PubMed Central

Draft Genome Sequence of the Marine Streptomyces sp. Strain PP-C42, Isolated from the Baltic Sea

Author: L. Fan
Y. Liu
Z. Li
H. I. Baumann
K. Kleinschmidt
W. Ye
J. F. Imhoff
M. Kleine
D. Cai
Anderson
Besemer
Blättel
Ceylan
Chor
Gueguen
Ladjama
Lagesen
Marçais
Moran
Nett
Ohnishi
Pathom-Aree
Schattner
Thomas
Wang
Wang
Zerbino
Publication venue: 'American Society for Microbiology'
Publication date: 01/01/2007
Field of study

Streptomyces, a branch of aerobic Gram-positive bacteria represents the largest genus of actinobacteria. The streptomycetes are characterized by a complex secondary metabolism and produce over two-thirds of the clinically used natural antibiotics today. Here we report the draft genome sequence of a Streptomyces strain PP-C42 isolated from the marine environment. A subset of unique genes and gene clusters for diverse secondary metabolites as well as antimicrobial peptides (AMPs) could be identified from the genome, showing great promise as a source for novel bioactive compound

Draft Genome Sequence of the Marine Streptomyces sp. Strain PP-C42, Isolated from the Baltic Sea

Author: Anderson
Besemer
Blättel
Ceylan
Chor
D. Cai
Gueguen
H. I. Baumann
J. F. Imhoff
K. Kleinschmidt
L. Fan
Ladjama
Lagesen
M. Kleine
Marçais
Moran
Nett
Ohnishi
Pathom-Aree
Schattner
Thomas
W. Ye
Wang
Wang
Y. Liu
Z. Li
Zerbino
Publication venue: 'American Society for Microbiology'
Publication date: 01/01/2011
Field of study

OceanRep

Crossref

PubMed Central

The ecology and biogeochemistry of stream biofilms

Author: A Böhme
A Dopheide
A Dopheide
A Frossard
A Hector
Aaron I. Packmann
AM Romaní
AM Romaní
AM Romaní
Anna M. Romani
AW Decho
B Guenet
BC Crump
BJ Cardinale
BJ Cardinale
C Baschien
C Chen
C Freeman
C Ruiz-González
CC Hakenkamp
D Taherzadeh
DJ Van Horn
DL Kirchman
DL Strayer
DR Lyon
E Vignaga
F Boano
F Bärlocher
F Garcia-Pichel
G Lear
G Singer
G Singer
GG Geesey
H Peter
H-C Flemming
HM Nepf
I Buriánková
I Hödl
I Ylla
IW Sutherland
J Downing
J Liu
J Wang
J Wimpenny
JA Vorholt
JB Logue
JBH Martiny
JBH Martiny
JD Drummond
JD Watrous
JJ Beaulieu
JJ Piggott
JK Wey
JR Lawrence
JR Lawrence
JW Costerton
JW Costerton
K Besemer
K Besemer
K Besemer
K Besemer
K Besemer
K Celler
K Drescher
K Wagner
KA Kuehn
Katharina Besemer
L Wilhelm
L Wilhelm
L Zeglin
M Danger
M Loreau
MA Leibold
MA Lock
Mia M. Bengtsson
ML Cadenasso
MM Bengtsson
MR Parsek
N Fierer
O Rendueles
OA Olapade
PA Raymond
PJ Mulholland
PS Stewart
R Araya
R Daniel
R Freimann
R Rusconi
RJ Newton
RL Sinsabaugh
S Arnon
S Findlay
S Jacquet
S Naeem
S Widder
S Woodcock
SE Ziegler
SEG Findlay
SN Merbt
ST Rier
T Neu
TJ Battin
TJ Battin
TJ Battin
TJ Battin
TJ Battin
TK Haack
Tom J. Battin
TRR Pintelon
U Risse-Buhl
W Zhang
X Timoner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Understanding the impact of antibiotic therapies on the respiratory tract resistome: A novel pooled-template metagenomic sequencing strategy

Author: AM Bolger
B Bushnell
B Jia
BA Brown-Elliott
CK Stover
DJ Serisier
DT Truong
E Catherinot
EA Champion
F El Garch
F Ghanbari
F Martineau
GB Rogers
J Besemer
J Jung
J Wang
JL Burns
K Reddington
L Fu
L Zhang
LJ Sherrard
M Denton
M Kresken
MA Nadkarni
MD Parkins
N Lechtzin
O Bar-On
R Li
RL Hill
S Tristram
SW Long
Y Peng
Z Aktas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Determining the effects of antimicrobial therapies on airway microbiology at a population-level is essential. Such analysis allows, for example, surveillance of antibiotic-induced changes in pathogen prevalence, the emergence and spread of antibiotic resistance, and the transmission of multi-resistant organisms. However, current analytical strategies for understanding these processes are limited. Culture- and PCR-based assays for specific microbes require the a priori selection of targets, while antibiotic sensitivity testing typically provides no insight into either the molecular basis of resistance, or the carriage of resistance determinants by the wider commensal microbiota. Shotgun metagenomic sequencing provides an alternative approach that allows the microbial composition of clinical samples to be described in detail, including the prevalence of resistance genes and virulence traits. While highly informative, the application of metagenomics to large patient cohorts can be prohibitively expensive. Using sputum samples from a randomised placebo-controlled trial of erythromycin in adults with bronchiectasis, we describe a novel, cost-effective strategy for screening patient cohorts for changes in resistance gene prevalence. By combining metagenomic screening of pooled DNA extracts with validatory quantitative PCR-based analysis of candidate markers in individual samples, we identify population-level changes in the relative abundance of specific macrolide resistance genes. This approach has the potential to provide an important adjunct to current analytical strategies, particularly within the context of antimicrobial clinical trials

Crossref

Directory of Open Access Journals

espace@Curtin

Flinders Academic Commons

University of Queensland eSpace

ClustScan: an integrated program package for the semi-automatic annotation of modular biosynthetic gene clusters and in silico prediction of novel chemical structures

Author: Altschul
Antonio Starcevic
Bateman
Bentley
Besemer
Caffrey
Castonguay
Challis
Daslav Hranueli
Del Vecchio
Delcher
Eddy
Finking
Fischbach
Haydock
Haydock
Hranueli
Ikeda
Jenke-Kodama
John Cullum
Jurica Simunkovic
Jurica Zucko
Keatinge-Clay
Lau
Long
Minowa
Oliynyk
Paul F. Long
Rausch
Reeves
Reid
Rusch
Starcevic
Starcevic
Tae
Weininger
Weissman
Yadav
Yadav
Zazopoulos
Zotchev
Zucko
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The program package ‘ClustScan’ (Cluster Scanner) is designed for rapid, semi-automatic, annotation of DNA sequences encoding modular biosynthetic enzymes including polyketide synthases (PKS), non-ribosomal peptide synthetases (NRPS) and hybrid (PKS/NRPS) enzymes. The program displays the predicted chemical structures of products as well as allowing export of the structures in a standard format for analyses with other programs. Recent advances in understanding of enzyme function are incorporated to make knowledge-based predictions about the stereochemistry of products. The program structure allows easy incorporation of additional knowledge about domain specificities and function. The results of analyses are presented to the user in a graphical interface, which also allows easy editing of the predictions to incorporate user experience. The versatility of this program package has been demonstrated by annotating biochemical pathways in microbial, invertebrate animal and metagenomic datasets. The speed and convenience of the package allows the annotation of all PKS and NRPS clusters in a complete Actinobacteria genome in 2–3 man hours. The open architecture of ClustScan allows easy integration with other programs, facilitating further analyses of results, which is useful for a broad range of researchers in the chemical and biological sciences

Crossref

PubMed Central

UCL Discovery

King's Research Portal

VIGOR, an annotation program for small viral genomes

Author: A Gradi
AC Palmenberg
AL Kistler
DA Steinhauer
David Spiro
DJ Deming
F Dos Ramos
FB Guo
J Besemer
J Matthijnssens
J Zhang
J Ziebuhr
Jaideep P Sundaram
KG Nicholson
KP Alekseev
L Kiemer
LL Chen
M Borodovsky
M Hasoksuz
MK Estes
R Mills
R Sharma
RL Graham
SF Altschul
Shiliang Wang
SM McDonald
SM Mount
T Horimoto
W Chen
W Li
X Zhang
X Zhang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The decrease in cost for sequencing and improvement in technologies has made it easier and more common for the re-sequencing of large genomes as well as parallel sequencing of small genomes. It is possible to completely sequence a small genome within days and this increases the number of publicly available genomes. Among the types of genomes being rapidly sequenced are those of microbial and viral genomes responsible for infectious diseases. However, accurate gene prediction is a challenge that persists for decoding a newly sequenced genome. Therefore, accurate and efficient gene prediction programs are highly desired for rapid and cost effective surveillance of RNA viruses through full genome sequencing. Results We have developed VIGOR (Viral Genome ORF Reader), a web application tool for gene prediction in influenza virus, rotavirus, rhinovirus and coronavirus subtypes. VIGOR detects protein coding regions based on sequence similarity searches and can accurately detect genome specific features such as frame shifts, overlapping genes, embedded genes, and can predict mature peptides within the context of a single polypeptide open reading frame. Genotyping capability for influenza and rotavirus is built into the program. We compared VIGOR to previously described gene prediction programs, ZCURVE_V, GeneMarkS and FLAN. The specificity and sensitivity of VIGOR are greater than 99% for the RNA viral genomes tested. Conclusions VIGOR is a user friendly web-based genome annotation program for five different viral agents, influenza, rotavirus, rhinovirus, coronavirus and SARS coronavirus. This is the first gene prediction program for rotavirus and rhinovirus for public access. VIGOR is able to accurately predict protein coding genes for the above five viral types and has the capability to assign function to the predicted open reading frames and genotype influenza virus. The prediction software was designed for performing high throughput annotation and closure validation in a post-sequencing production pipeline.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

Author: A Bateman
A Nekrutenko
AC McHardy
AG Murzin
CA Orengo
D Fischer
DB Rusch
DH Haft
DL Wheeler
E Birney
ED Harrington
EF DeLong
EF DeLong
F Corpet
F Sanger
FMDL Vega
Granger Sutton
GW Tyson
H Noguchi
H Ochman
J Besemer
J Quackenbush
JA Eisen
JC Venter
K Chen
K Mavromatis
L Krause
L Rychlewski
M Margulies
M Sait
N Siew
R Seshadri
R Unger
RC Edgar
S Yooseph
SF Altschul
SF Altschul
SG Tringe
Shibu Yooseph
SJ Giovannoni
SR Gill
W Li
W Li
W Li
Weizhong Li
Z Yang
Z Yang
Publication venue: BioMed Central
Publication date: 01/04/2008
Field of study

Abstract Background The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. Results We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net). Conclusion The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Origin of Saxitoxin Biosynthetic Genes in Cyanobacteria

Author: A Krohn
A Moustafa
A Moustafa
A Stamatakis
AG Murzin
Ahmed Moustafa
CJS Bolch
Debashish Bhattacharya
DL Wheeler
Donald M. Anderson
EA O'Brien
EC Beltran
F Pomati
F. Gerald Plumley
FG Plumley
G Christiansen
G Talavera
G Taroncher-Oldenburg
I Chorus
I Letunic
J Besemer
J Felsenstein
J Felsenstein
J-S Kim
Jason E. Stajich
Jeannette E. Loram
Jeremiah D. Hackett
K Katoh
K Lagesen
K Wilson
KD Pruitt
O Zhaxybayeva
P Uribe
R Kellmann
R Kellmann
R Rowan
RA Laskowski
RK Aziz
S Tavaré
S Whelan
SF Altschul
SP Hawser
TT Nguyen
W Gish
WW Carmichael
Y Sako
Y Shimizu
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

BACKGROUND:Paralytic shellfish poisoning (PSP) is a potentially fatal syndrome associated with the consumption of shellfish that have accumulated saxitoxin (STX). STX is produced by microscopic marine dinoflagellate algae. Little is known about the origin and spread of saxitoxin genes in these under-studied eukaryotes. Fortuitously, some freshwater cyanobacteria also produce STX, providing an ideal model for studying its biosynthesis. Here we focus on saxitoxin-producing cyanobacteria and their non-toxic sisters to elucidate the origin of genes involved in the putative STX biosynthetic pathway. METHODOLOGY/PRINCIPAL FINDINGS:We generated a draft genome assembly of the saxitoxin-producing (STX+) cyanobacterium Anabaena circinalis ACBU02 and searched for 26 candidate saxitoxin-genes (named sxtA to sxtZ) that were recently identified in the toxic strain Cylindrospermopsis raciborskii T3. We also generated a draft assembly of the non-toxic (STX-) sister Anabaena circinalis ACFR02 to aid the identification of saxitoxin-specific genes. Comparative phylogenomic analyses revealed that nine putative STX genes were horizontally transferred from non-cyanobacterial sources, whereas one key gene (sxtA) originated in STX+ cyanobacteria via two independent horizontal transfers followed by fusion. In total, of the 26 candidate saxitoxin-genes, 13 are of cyanobacterial provenance and are monophyletic among the STX+ taxa, four are shared amongst STX+ and STX-cyanobacteria, and the remaining nine genes are specific to STX+ cyanobacteria. CONCLUSIONS/SIGNIFICANCE:Our results provide evidence that the assembly of STX genes in ACBU02 involved multiple HGT events from different sources followed presumably by coordination of the expression of foreign and native genes in the common ancestor of STX+ cyanobacteria. The ability to produce saxitoxin was subsequently lost multiple independent times resulting in a nested relationship of STX+ and STX- strains among Anabaena circinalis strains

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Gene prediction in metagenomic fragments: A large scale machine learning approach

Author: A Lukashin
AL Delcher
BE Suzek
Burkhard Morgenstern
CJ van Rijsbergen
CM Bishop
CS Riesenfeld
D Frishman
DA Benson
DJC MacKay
F Sanger
F Wilcoxon
GW Tyson
H Noguchi
HY Ou
IT Nabney
J Besemer
J Handelsman
JC Venter
K Chen
Katharina J Hoff
KE Rudd
L Krause
M Ronaghi
M Tech
M Tech
Maike Tech
MS Rappe
P Hugenholtz
P Nielson
Peter Meinicke
R Amann
R Daniel
R Daniel
R Development Core Team
RA Edwards
Rolf Daniel
S Altschul
S Voget
SG Tringe
T Hastie
T Jarvie
Thomas Lingner
V Torsvik
VB Bajic
W Streit
Publication venue: BioMed Central
Publication date: 01/04/2008
Field of study

Abstract Background Metagenomics is an approach to the characterization of microbial genomes via the direct isolation of genomic sequences from the environment without prior cultivation. The amount of metagenomic sequence data is growing fast while computational methods for metagenome analysis are still in their infancy. In contrast to genomic sequences of single species, which can usually be assembled and analyzed by many available methods, a large proportion of metagenome data remains as unassembled anonymous sequencing reads. One of the aims of all metagenomic sequencing projects is the identification of novel genes. Short length, for example, Sanger sequencing yields on average 700 bp fragments, and unknown phylogenetic origin of most fragments require approaches to gene prediction that are different from the currently available methods for genomes of single species. In particular, the large size of metagenomic samples requires fast and accurate methods with small numbers of false positive predictions. Results We introduce a novel gene prediction algorithm for metagenomic fragments based on a two-stage machine learning approach. In the first stage, we use linear discriminants for monocodon usage, dicodon usage and translation initiation sites to extract features from DNA sequences. In the second stage, an artificial neural network combines these features with open reading frame length and fragment GC-content to compute the probability that this open reading frame encodes a protein. This probability is used for the classification and scoring of gene candidates. With large scale training, our method provides fast single fragment predictions with good sensitivity and specificity on artificially fragmented genomic DNA. Additionally, this method is able to predict translation initiation sites accurately and distinguishes complete from incomplete genes with high reliability. Conclusion Large scale machine learning methods are well-suited for gene prediction in metagenomic DNA fragments. In particular, the combination of linear discriminants and neural networks is promising and should be considered for integration into metagenomic analysis pipelines. The data sets can be downloaded from the URL provided (see Availability and requirements section).</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central