Search CORE

105 research outputs found

ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process

Author: CJ Stubben
D Barker
D Barker
D Haft
D Szklarczyk
DA Rodionov
Daniel H Haft
DH Haft
DH Haft
DH Haft
EM Marcotte
F Eckstein
F Enault
GV Glazko
H-Y Ou
J Sun
J Wu
J-P Vert
JAG Ranea
JD Selengut
JD Selengut
JD Selengut
Jeremy D Selengut
L Ferrer
M Csurös
M Huynen
M Pellegrini
MA Huynen
Malay K Basu
MS Gelfand
P Pagel
PM Bowers
PR Kensche
PS Dehal
R Jothi
RL Tatusov
S Briesemeister
S Freilich
SR Eddy
SV Date
SV Date
T Blum
T Gaasterland
T Xu
T Yamada
X Brazzolotto
Y Hong
Y Liu
Y Zhou
Z Jiang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

MicroScope: a platform for microbial genome annotation and comparative genomics

Author: A. Lajus
Almeida
Bairoch
Bairoch
Barbe
Bendtsen
Bocs
Bryson
C. Médigue
C. Scarpelli
Carver
Caspi
Claudel-Renard
Cruveiller
D'A
D. Mornico
D. Roche
D. Vallenet
G. Salvignol
Gardner
Gardy
Gil
Glasner
Hacker
Hubbard
Hunter
Kanehisa
Karp
Klimke
L. Fleury
Lagesen
Lima
Lowe
Marcotte
Markowitz
Markowitz
Matsumoto
Meyer
Overbeek
Overbeek
Overbeek
Pellegrini
Pelletier
Pruitt
S. Cruveiller
S. Engelen
Saier
Salzberg
Sayers
Selengut
Serres
Sonnhammer
Tatusov
Vallenet
Walter
Waterhouse
Winsor
Z. Rouy
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http://www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope’s rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of microbial genome annotation, especially for genomes initially analyzed by automatic procedures alone

Crossref

PubMed Central

HAL-CEA

FastBLAST: Homology Relationships for Millions of Proteins

Author: A Marchler-Bauer
AA Schaffer
Adam P. Arkin
BE Suzek
Cecile Fairhead
CH Wu
CM Zmasek
D Wilson
F Pearl
H Mi
I Letunic
JD Selengut
LB Koski
M Remm
MN Price
Morgan N. Price
NJ Mulder
Paramvir S. Dehal
PS Dehal
R Durbin
RD Finn
RL Tatusov
S Yooseph
SF Altschul
W Gish
W Li
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

BackgroundAll-versus-all BLAST, which searches for homologous pairs of sequences in a database of proteins, is used to identify potential orthologs, to find new protein families, and to provide rapid access to these homology relationships. As DNA sequencing accelerates and data sets grow, all-versus-all BLAST has become computationally demanding.Methodology/principal findingsWe present FastBLAST, a heuristic replacement for all-versus-all BLAST that relies on alignments of proteins to known families, obtained from tools such as PSI-BLAST and HMMer. FastBLAST avoids most of the work of all-versus-all BLAST by taking advantage of these alignments and by clustering similar sequences. FastBLAST runs in two stages: the first stage identifies additional families and aligns them, and the second stage quickly identifies the homologs of a query sequence, based on the alignments of the families, before generating pairwise alignments. On 6.53 million proteins from the non-redundant Genbank database ("NR"), FastBLAST identifies new families 25 times faster than all-versus-all BLAST. Once the first stage is completed, FastBLAST identifies homologs for the average query in less than 5 seconds (8.6 times faster than BLAST) and gives nearly identical results. For hits above 70 bits, FastBLAST identifies 98% of the top 3,250 hits per query.Conclusions/significanceFastBLAST enables research groups that do not have supercomputers to analyze large protein sequence data sets. FastBLAST is open source software and is available at http://microbesonline.org/fastblast

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

F420H2-Dependent Degradation of Aflatoxin and other Furanocoumarins Is Widespread throughout the Actinomycetales

Author: A Berkessel
A Ciegler
Andrew C. Warden
B Gerratana
C Bornemann
Colin Scott
D Guerra-Lopez
D Hormisch
D Isabelle
E Purwantini
E Purwantini
EL Spence
F Jacobson
F Lynen
G Bashiri
G Bashiri
G Heiss
Gauri V. Lapalikar
I Alexeev
JD Selengut
JF Alberts
JJ Griese
JM Wagacha
JN Pitts
John G. Oakeshott
John R. Battista
JY Chung
K Kakinuma
K Tamura
KP Choi
LMI de Poorter
M Kaneko
M Mack
Matthew C. Taylor
MC Taylor
MR Barnes
OD Teniola
R Brodersen
R Singh
RD Draper
Robyn J. Russell
S Ebert
S Ebert
S Grill
S Otani
S Strickland
T Hamamoto
T Oja
TD Bugg
UH Manjunatha
W Li
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Two classes of F420-dependent reductases (FDR-A and FDR-B) that can reduce aflatoxins and thereby degrade them have previously been isolated from Mycobacterium smegmatis. One class, the FDR-A enzymes, has up to 100 times more activity than the other. F420 is a cofactor with a low reduction potential that is largely confined to the Actinomycetales and some Archaea and Proteobacteria. We have heterologously expressed ten FDR-A enzymes from diverse Actinomycetales, finding that nine can also use F420H2 to reduce aflatoxin. Thus FDR-As may be responsible for the previously observed degradation of aflatoxin in other Actinomycetales. The one FDR-A enzyme that we found not to reduce aflatoxin belonged to a distinct clade (herein denoted FDR-AA), and our subsequent expression and analysis of seven other FDR-AAs from M. smegmatis found that none could reduce aflatoxin. Certain FDR-A and FDR-B enzymes that could reduce aflatoxin also showed activity with coumarin and three furanocoumarins (angelicin, 8-methoxysporalen and imperatorin), but none of the FDR-AAs tested showed any of these activities. The shared feature of the compounds that were substrates was an α,β-unsaturated lactone moiety. This moiety occurs in a wide variety of otherwise recalcitrant xenobiotics and antibiotics, so the FDR-As and FDR-Bs may have evolved to harness the reducing power of F420 to metabolise such compounds. Mass spectrometry on the products of the FDR-catalyzed reduction of coumarin and the other furanocoumarins shows their spontaneous hydrolysis to multiple products

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Predicted Relative Metabolomic Turnover (PRMT): determining metabolic turnover from a coastal marine metagenomic dataset

Author: KO Buesseler
Falkowski G Paul
JA Gilbert
S Mitra
JA Gilbert
JG Bundy
MR Viant
MR Viant
C-Y Lin
JC Wooley
KB Heidelberg
JA Gilbert
F Meyer
M Kanehisa
R Overbeek
JD Selengut
TA Gianoulis
D Field
HW Ma
Y Rao
H Petković
FO Glöckner
PM Sivakumar
JY Cho
K Motohashi
A Paytan
VS Mikhail
JH Martin
JH Martin
JH Martin
JH Street
S Blain
JA Gilbert
AN Kulakova
JP Quinn
DJ Repeta
GW Gooday
C Jeuniaux
NO Keyhani
MT Cottrell
MT Cottrell
MT Cottrell
AL Svitil
AJ Southward
H Petković
P Shannon
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

We present an approach in which the semantics of an XML language is defined by means of a transformation from an XML document model (an XML schema) to an application specific model. The application specific model implements the intended behavior of documents written in the language. A transformation is specified in a model transformation language used in the Model Driven Architecture (MDA) approach for software development. Our approach provides a better separation of three concerns found in XML applications: syntax, syntax processing logic and intended meaning of the syntax. It frees the developer of low-level syntactical details and improves the adaptability and reusability of XML applications. Declarative transformation rules and the explicit application model provide a finer control over the application parts affected by adaptations. Transformation rules and the application model for an XML language may be composed with the corresponding rules and application models defined for other XML languages. In that way we achieve reuse and composition of XML applications

Queen's University Belfast Research Portal

CiteSeerX

Crossref

Springer - Publisher Connector

PubMed Central

University of Twente Research Information

Predicted Relative Metabolomic Turnover (PRMT): determining metabolic turnover from a coastal marine metagenomic dataset

Author: A Paytan
AJ Southward
AL Svitil
AN Kulakova
C Jeuniaux
C-Y Lin
D Field
DJ Repeta
F Meyer
Falkowski G Paul
FO Glöckner
GW Gooday
H Petković
H Petković
HW Ma
JA Gilbert
JA Gilbert
JA Gilbert
JA Gilbert
JC Wooley
JD Selengut
JG Bundy
JH Martin
JH Martin
JH Martin
JH Street
JP Quinn
JY Cho
K Motohashi
KB Heidelberg
KO Buesseler
M Kanehisa
MR Viant
MR Viant
MT Cottrell
MT Cottrell
MT Cottrell
NO Keyhani
P Shannon
PM Sivakumar
R Overbeek
S Blain
S Mitra
TA Gianoulis
VS Mikhail
Y Rao
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Queen's University Belfast Research Portal

Crossref

Springer - Publisher Connector

PubMed Central

New developments in the InterPro database

InterPro is an integrated resource for protein families, domains and functional sites, which integrates the following protein signature databases: PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER. The latter two new member databases have been integrated since the last publication in this journal. There have been several new developments in InterPro, including an additional reading field, new database links, extensions to the web interface and additional match XML files. InterPro has always provided matches to UniProtKB proteins on the website and in the match XML file on the FTP site. Additional matches to proteins in UniParc (UniProt archive) are now available for download in the new match XML files only. The latest InterPro release (13.0) contains more than 13 000 entries, covering over 78% of all proteins in UniProtKB. The database is available for text- and sequence-based searches via a webserver (), and for download by anonymous FTP (). The InterProScan search tool is now also available via a web service at

HAL Descartes

The University of Manchester - Institutional Repository

ProdInra

Archive ouverte UNIGE

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

PubMed Central

UCL Discovery

Oxford University Research Archive

MDC Repository

Explore Bristol Research

Comparative Genomics of Emerging Human Ehrlichiosis Agents

Anaplasma (formerly Ehrlichia) phagocytophilum, Ehrlichia chaffeensis, and Neorickettsia (formerly Ehrlichia) sennetsu are intracellular vector-borne pathogens that cause human ehrlichiosis, an emerging infectious disease. We present the complete genome sequences of these organisms along with comparisons to other organisms in the Rickettsiales order. Ehrlichia spp. and Anaplasma spp. display a unique large expansion of immunodominant outer membrane proteins facilitating antigenic variation. All Rickettsiales have a diminished ability to synthesize amino acids compared to their closest free-living relatives. Unlike members of the Rickettsiaceae family, these pathogenic Anaplasmataceae are capable of making all major vitamins, cofactors, and nucleotides, which could confer a beneficial role in the invertebrate vector or the vertebrate host. Further analysis identified proteins potentially involved in vacuole confinement of the Anaplasmataceae, a life cycle involving a hematophagous vector, vertebrate pathogenesis, human pathogenesis, and lack of transovarial transmission. These discoveries provide significant insights into the biology of these obligate intracellular pathogens

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Correction: Comparative Genomics of Emerging Human Ehrlichiosis Agents

Crossref

Directory of Open Access Journals

PubMed Central

CloVR: A virtual machine for automated and portable sequence analysis from the desktop using cloud computing

Author: A Bateman
A Bateman
A Tridgell
Aaron Gussman
AC Stewart
AL Delcher
B Langmead
B Langmead
BE Suzek
C Hemmerich
C Rapier
Cesar Arze
D Field
D Hull
David R Riley
DL Wheeler
DR Zerbino
E Afgan
EE Schadt
F Meyer
J Dean
J Goecks
J Orvis
J White
J White
J White
James R White
JD Selengut
JG Caporaso
JP Mesirov
JR Cole
JR Miller
JR White
JT Dudley
K Galens
K Keahey
K Lagesen
Kevin Galens
LD Stein
M Reich
Mahesh Vangala
Malcolm Matalka
MC Schatz
MC Schatz
MC Schatz
O Trelles
Owen White
PD Schloss
RC Edgar
RK Aziz
RL Tatusov
S Angiuoli
Samuel V Angiuoli
SD Kahn
SF Altschul
SF Altschul
SR Eddy
TM Lowe
W Florian Fricke
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Next-generation sequencing technologies have decentralized sequence acquisition, increasing the demand for new bioinformatics tools that are easy to use, portable across multiple platforms, and scalable for high-throughput applications. Cloud computing platforms provide on-demand access to computing infrastructure over the Internet and can be used in combination with custom built virtual machines to distribute pre-packaged with pre-configured software. We describe the Cloud Virtual Resource, CloVR, a new desktop application for push-button automated sequence analysis that can utilize cloud computing resources. CloVR is implemented as a single portable virtual machine (VM) that provides several automated analysis pipelines for microbial genomics, including 16S, whole genome and metagenome sequence analysis. The CloVR VM runs on a personal computer, utilizes local computer resources and requires minimal installation, addressing key challenges in deploying bioinformatics workflows. In addition CloVR supports use of remote cloud computing resources to improve performance for large-scale sequence processing. In a case study, we demonstrate the use of CloVR to automatically process next-generation sequencing data on multiple cloud computing platforms. The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing.https://doi.org/10.1186/1471-2105-12-35

Crossref

Springer - Publisher Connector

PubMed Central

Digital Repository at the University of Maryland