Search CORE

49 research outputs found

Recovering complete and draft population genomes from metagenome datasets

Author: A Bankevich
A Charuvaka
A Ramette
BJ Baker
C Luo
CJ Castelle
CL Dupont
CT Brown
CT Brown
D Earl
D Wu
DD Kang
DD Sommer
DH Haft
DH Parks
DJ Edwards
DR Mende
DR Zerbino
E Georganas
F Vezzi
FA Simão
GJ Dick
GW Tyson
HB Nielsen
I Sharon
J Alneberg
J Pell
J Qin
JA Gilbert
JF Vázquez-Castellanos
JT Simpson
K Mavromatis
K Salikhov
KC Wrighton
KM Handley
KR Bradnam
LM Rodriguez-R
LM Rodriguez-R
M Albertsen
M Botzman
M Eppinger
M Hess
M Hunt
M Imelfort
M Ofek-Lalzar
M Pignatelli
M Punta
M Roller
M Scholz
M Wu
M Wu
MC Wendl
MJ Morowitz
N Sangwan
OU Nalbantoglu
PSG Chain
R Ghai
R Luo
R Mackelprang
R Suzuki
R Vicedomini
RS Kantor
S Akhter
S Boisvert
S Boisvert
S Heilbronner
S Koren
SC Clark
SC Rienzi Di
SL Salzberg
SM Gibbons
T Davidsen
T Namiki
TJ Treangen
V Iverson
X Deng
X Huang
Y Kodama
Y Peng
Y-W Wu
Z Zhang
Z-S Hua
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/03/2016
Field of study

Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. Here, we provide a synthesis of available methods to bin metagenomic contigs into species-level groups and highlight how genetic diversity, sequencing depth, and coverage influence binning success. Despite the computational cost on application to deeply sequenced complex metagenomes (e.g., soil), covarying patterns of contig coverage across multiple datasets significantly improves the binning process. We also discuss and compare current genome validation methods and reveal how these methods tackle the problem of chimeric genome bins i.e., sequences from multiple species. Finally, we explore how population genome assembly can be used to uncover biogeographic trends and to characterize the effect of in situ functional constraints on the genome-wide evolution

Crossref

Woods Hole Open Access Server

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Evaluating the Fidelity of De Novo Short Read Metagenomic Assembly Using Simulated Data

Author: A Brady
A Charuvaka
A Lopez-Bueno
AE Darling
Andrés Moya
D Hernandez
DB Jaffe
DB Rusch
DC Richter
DD Sommer
DH Haft
DH Huson
DR Zerbino
EW Myers
F Meyer
GG Sutton
GW Tyson
I Letunic
I Maccallum
J Laserson
J Qin
JA Huber
JC Dohm
JC Dohm
JC Wooley
JO Korbel
Jonathan H. Badger
JR Miller
JR Miller
JT Simpson
K Liolios
K Mavromatis
KE Wommack
L Krause
M de la Bastide
M Margulies
M Pop
M Stark
M Wu
Miguel Pignatelli
MJ Chaisson
NN Diaz
OU Nalbantoglu
PJ Turnbaugh
PJ Turnbaugh
R Li
R Seshadri
RD Finn
RL Tatusov
RL Warren
S Batzoglou
S Levy
S Yooseph
SM Huse
SR Gill
T Schoenfeld
TS Ghosh
VM Markowitz
WJ Kent
WR Jeck
X Huang
X Huang
Y Ye
Publication venue: Public Library of Science
Publication date: 23/05/2011
Field of study

A frequent step in metagenomic data analysis comprises the assembly of the sequenced reads. Many assembly tools have been published in the last years targeting data coming from next-generation sequencing (NGS) technologies but these assemblers have not been designed for or tested in multi-genome scenarios that characterize metagenomic studies. Here we provide a critical assessment of current de novo short reads assembly tools in multi-genome scenarios using complex simulated metagenomic data. With this approach we tested the fidelity of different assemblers in metagenomic studies demonstrating that even under the simplest compositions the number of chimeric contigs involving different species is noticeable. We further showed that the assembly process reduces the accuracy of the functional classification of the metagenomic data and that these errors can be overcome raising the coverage of the studied metagenome. The results presented here highlight the particular difficulties that de novo genome assemblers face in multi-genome scenarios demonstrating that these difficulties, that often compromise the functional classification of the analyzed data, can be overcome with a high sequencing effort

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Comparative genomics of the bacterial genus Listeria: Genome evolution is characterized by limited gene acquisition and limited gene loss

Author: A Akya
A Bateman
A Camejo
A Leclercq
ACE Darling
AJ Cummins
AJ Drummond
AM Phillippy
AP Rooney
B González-Zorn
C Buchrieser
C Steinweg
CA Cummings
Craig A Cummings
D Liu
D Volokhov
D Volokhov
DE Waldron
DH Haft
DL Swofford
DR Zerbino
E Gouin
F Ronquist
G Greub
G Lima-Mendez
GG Wilson
H Bierne
H Ochman
H Ochman
H Tettelin
HC den Bakker
Henk C den Bakker
I Grissa
I Karunasagar
J Dorscht
J Felsenstein
J Goris
J Johnson
J Rocourt
J Vázquez-Boland
JA Vazquez-Boland
JD Mcpherson
JR McQuiston
K Katoh
K Nightingale
KA Dunn
KE Nelson
KK Nightingale
L Braun
L Snipen
LA Marraffini
LM Graves
Lovorka Degoricija
M Krzywinski
M Lebrun
M Schmid
M Sebaihia
Manohar R Furtado
Martin Wiedmann
Melissa Barker
MJ Chaisson
MJ Pallen
MP Lessing
MR Tock
MW Gilmour
N Faith
NA Moran
NE Freitag
Olga Petrauskene
ORP Bininda-Emonds
P Boerlin
P Glaser
P Horvath
P Larsson
Paolo Vatta
PSG Chain
R Leplae
R Suzuki
Renato H Orsi
RH Orsi
RH Orsi
RH Orsi
RK Aziz
S Erdenlig
S Götz
S Kumar
S Kurtz
S Machata
S Stuart
S Waack
SF Altschul
T Hain
T Hain
T Rajabian
TD Read
Vania Ferreira
X Didelot
Publication venue: BioMed Central
Publication date: 01/12/2010
Field of study

Abstract Background The bacterial genus <it>Listeria </it>contains pathogenic and non-pathogenic species, including the pathogens <it>L. monocytogenes </it>and <it>L. ivanovii</it>, both of which carry homologous virulence gene clusters such as the <it>prfA </it>cluster and clusters of internalin genes. Initial evidence for multiple deletions of the <it>prfA </it>cluster during the evolution of <it>Listeria </it>indicates that this genus provides an interesting model for studying the evolution of virulence and also presents practical challenges with regard to definition of pathogenic strains. Results To better understand genome evolution and evolution of virulence characteristics in <it>Listeria</it>, we used a next generation sequencing approach to generate draft genomes for seven strains representing <it>Listeria </it>species or clades for which genome sequences were not available. Comparative analyses of these draft genomes and six publicly available genomes, which together represent the main <it>Listeria </it>species, showed evidence for (i) a pangenome with 2,032 core and 2,918 accessory genes identified to date, (ii) a critical role of gene loss events in transition of <it>Listeria </it>species from facultative pathogen to saprotroph, even though a consistent pattern of gene loss seemed to be absent, and a number of isolates representing non-pathogenic species still carried some virulence associated genes, and (iii) divergence of modern pathogenic and non-pathogenic <it>Listeria </it>species and strains, most likely circa 47 million years ago, from a pathogenic common ancestor that contained key virulence genes. Conclusions Genome evolution in <it>Listeria </it>involved limited gene loss and acquisition as supported by (i) a relatively high coverage of the predicted pan-genome by the observed pan-genome, (ii) conserved genome size (between 2.8 and 3.2 Mb), and (iii) a highly syntenic genome. Limited gene loss in <it>Listeria </it>did include loss of virulence associated genes, likely associated with multiple transitions to a saprotrophic lifestyle. The genus <it>Listeria </it>thus provides an example of a group of bacteria that appears to evolve through a loss of virulence rather than acquisition of virulence characteristics. While <it>Listeria </it>includes a number of species-like clades, many of these putative species include clades or strains with atypical virulence associated characteristics. This information will allow for the development of genetic and genomic criteria for pathogenic strains, including development of assays that specifically detect pathogenic <it>Listeria </it>strains.</p

Crossref

Directory of Open Access Journals

PubMed Central

Multiple Data Analyses and Statistical Approaches for Analyzing Data from Metagenomic Studies and Clinical Trials

Author: A Brady
A Oulas
A Pati
A Wilke
AM Bolger
B Broeksema
B Buchfink
B Chevreux
B Lai
B Langmead
B Liu
C Bland
C Lozupone
C Lozupone
C Quast
C Quince
C-KK Chan
CC Laczny
CC Laczny
CJF Terbraak
CS Riesenfeld
D Ai
D Arndt
D Hyatt
DA Benson
DD Kang
DH Haft
DH Huson
DH Huson
DR Zerbino
EJ Richardson
G Bacaro
G Greub
GA Pavlopoulos
GJ Dick
H Hotelling
H Teeling
H Tuomisto
H Watson
H Zheng
H-H Lin
HW Virgin
I Borg
I Gregor
J Alneberg
J Chen
J Jovel
J Ni
J Peterson
J Vollmers
J Wang
JC Lagier
JD Forbes
JE Clarridge
JG Caporaso
JG Caporaso
JL Du
JR Bray
JS Ghurye
K Pearson
K Sato
KG Clarke
KJ Hoff
KP Aßhauer
KR Gabriel
KR Patil
L Kaufman
L Krause
L Smeds
M Hamady
M Imelfort
M Martin
M Rho
M Taylor
M Tessler
MGI Langille
MJ Anderson
MO Hill
MP Cox
NN Diaz
P Xu
PA Pevzner
PD Schloss
PD Schloss
PJ McMurdie
Q Wang
R Development Core Team
R Luo
R McGill
R Overbeek
R Ranjan
R Schmieder
R Staden
RC Edgar
RC Edgar
RC Gentleman
RD Finn
RH Whittaker
RL Rodriguez
RL Tatusov
S Hunter
S Lindgreen
S Mitra
S Mitra
S Powell
SD Jackman
SF Altschul
T Hastie
VM Markowitz
W Zhu
Y Liu
Y Peng
Y Peng
Y Ye
Y Zhang
Y Zheng
Y-W Wu
ZT Yu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Metagenomics, also known as environmental genomics, is the study of the genomic content of a sample of organisms (microbes) obtained from a common habitat. Metagenomics and other “omics” disciplines have captured the attention of researchers for several decades. The effect of microbes in our body is a relevant concern for health studies. There are plenty of studies using metagenomics which examine microorganisms that inhabit niches in the human body, sometimes causing disease, and are often correlated with multiple treatment conditions. No matter from which environment it comes, the analyses are often aimed at determining either the presence or absence of specific species of interest in a given metagenome or comparing the biological diversity and the functional activity of a wider range of microorganisms within their communities. The importance increases for comparison within different environments such as multiple patients with different conditions, multiple drugs, and multiple time points of same treatment or same patient. Thus, no matter how many hypotheses we have, we need a good understanding of genomics, bioinformatics, and statistics to work together to analyze and interpret these datasets in a meaningful way. This chapter provides an overview of different data analyses and statistical approaches (with example scenarios) to analyze metagenomics samples from different medical projects or clinical trials

Crossref

White Rose Research Online

The Complete Genome Sequence of Fibrobacter succinogenes S85 Reveals a Cellulolytic and Metabolic Specialist

Author: A Marchler-Bauer
A Pati
AH Iyo
AH Iyo
B Ewing
B Ewing
BL Cantarel
C Claudel-Renard
C Lin
Cameron R. Currie
CG Moreira
Colleen Drinkwater
CS Han
D Gordon
D Groleau
D Hyatt
David M. Stevenson
David Mead
DB Wilson
DH Haft
DK Kam
DK Kam
DM Stevenson
DR Zerbino
E Griffiths
EA Bayer
EV Koonin
FD Ciccarelli
Frank O. Aylward
FW Paradis
G Maglione
G Xie
Garret Suen
GL Miller
H Kudo
H Matsui
H-S Jun
H-S Jun
H-S Jun
HJ Flint
J Gong
J Gong
J Purushe
Jan Deneke
JB Russell
JB Russell
JE Wells
JK Alexander
JK Alexander
Julie Boyum
K Lagesen
K Ogata
KK Cho
KP McDermid
L Huang
L Montgomery
LM Malburg Jr
LR Lynd
Lynne A. Goodwin
M Kanehisa
M Margulies
M Mitsumori
M Morrison
M Qi
M Qi
ME Berg Miller
MJ McGavin
MS Centeno
N Asanuma
N Ozcan
NA Spiridonov
Natalia Mikhailova
Natalia N. Ivanova
NP Cianciotto
O Emanuelsson
Olga Chertkov
P Brumm
Paul J. Weimer
Phillip J. Brumm
PJ Weimer
PJ Weimer
PJ Weimer
PJ Weimer
PJ Weimer
PM Vignais
R Cavicchioli
RD Finn
RE Hungate
RH Doi
RL Tatusov
RM Teather
RS Gupta
S Bennett
S Hunter
S Yoshida
S Yoshida
S Yoshida
SF Altschul
SO Han
SR Malburg
T Lowe
TL Miller
V Broussolle
W Hashimoto
W Hashimoto
Wenjun Li
WJ Costerton
WJ Kelly
Y Kobayashi
Y Nataf
Y Shi
Y Tamaru
Publication venue: Public Library of Science
Publication date: 19/04/2011
Field of study

Fibrobacter succinogenes is an important member of the rumen microbial community that converts plant biomass into nutrients usable by its host. This bacterium, which is also one of only two cultivated species in its phylum, is an efficient and prolific degrader of cellulose. Specifically, it has a particularly high activity against crystalline cellulose that requires close physical contact with this substrate. However, unlike other known cellulolytic microbes, it does not degrade cellulose using a cellulosome or by producing high extracellular titers of cellulase enzymes. To better understand the biology of F. succinogenes, we sequenced the genome of the type strain S85 to completion. A total of 3,085 open reading frames were predicted from its 3.84 Mbp genome. Analysis of sequences predicted to encode for carbohydrate-degrading enzymes revealed an unusually high number of genes that were classified into 49 different families of glycoside hydrolases, carbohydrate binding modules (CBMs), carbohydrate esterases, and polysaccharide lyases. Of the 31 identified cellulases, none contain CBMs in families 1, 2, and 3, typically associated with crystalline cellulose degradation. Polysaccharide hydrolysis and utilization assays showed that F. succinogenes was able to hydrolyze a number of polysaccharides, but could only utilize the hydrolytic products of cellulose. This suggests that F. succinogenes uses its array of hemicellulose-degrading enzymes to remove hemicelluloses to gain access to cellulose. This is reflected in its genome, as F. succinogenes lacks many of the genes necessary to transport and metabolize the hydrolytic products of non-cellulose polysaccharides. The F. succinogenes genome reveals a bacterium that specializes in cellulose as its sole energy source, and provides insight into a novel strategy for cellulose degradation

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Complete Genome Sequence of Thermoproteus tenax: A Physiologically Versatile Member of the Crenarchaeota

Author: A Hiller
A Schramm
A Swiatek
A Veith
Andrea Rosinus
André Plagens
Arnulf Kletzin
B Boeckmann
B Eikmanns
B Ewing
B Linke
B Siebers
Bettina Siebers
Britta Tjaden
C Baar
C Brochier-Armanet
CB Walker
Cecile Fairhead
CH Verhees
Christa Lanz
D Gordon
D Gordon
DG Ahn
DH Haft
DJ Naether
DR Smith
E Waters
ER Barry
EV Koonin
EV Koonin
F Fischer
F Li
F Meyer
F Sanger
F Werner
Fabian Blombach
Guenter Raddatz
H Huber
H Neumann
Hans-Peter Klenk
HP Klenk
I Anderson
I Orita
I Orita
J Easter Jr
J Van der Oost
JG Elkins
JG Elkins
JH Badger
JN Reeve
K Julenius
K Lewalter
K Liolios
KD Pruitt
Kira S. Makarova
KS Makarova
KS Makarova
KS Makarova
L Aravind
L Craig
LA Marraffini
LA Sazanov
M Csurös
M Eppinger
M Graupner
M Haering
M Kanehisa
M Selig
M Sumper
M Tsubaki
M Vaupel
M Zaparty
M Zaparty
Markus Rampp
Mathias von Jan
Melanie Zaparty
MGL Elferink
MT Facciotti
N Marinsek
N Yutin
Nikos Kyrpides
NP Robinson
NS Baliga
O Emanuelsson
P Rice
P Yarza
PP Chan
PP Gardner
Q Ren
R Barrangou
R Dirmeier
R Hedderich
R Jansen
RD Fleischmann
Reinhard Hensel
RH White
RK Lillestol
RL Tatusov
RW Rose
RY Samson
S Higuchi
S Kurtz
S Laska
S Paytubi
S Schaefer
S Tsoka
SA Qureshi
SD Bell
SD Bell
SF Altschul
SJ Hallam
SJJ Brouns
SL Salzberg
Sonja-Verena Albers
ST Fitz-Gibbon
Stephan C. Schuster
Steve D. Bell
SV Albers
SV Albers
T Coenye
T Lowe
T Mogi
T Soderberg
TL Born
TM Bandeiras
TM Zabriskie
U Jahn
V Müller
VM Markowitz
W Baumeister
W Wildhaber
W Zillig
WH Ramos-Vera
WH Ramos-Vera
X Luo
YM Drozdowicz
YM Drozdowicz
Z Szabó
Publication venue: PUBLIC LIBRARY SCIENCE
Publication date: 01/01/2011
Field of study

Here, we report on the complete genome sequence of the hyperthermophilic Crenarchaeum Thermoproteus tenax (strain Kra 1, DSM 2078(T)) a type strain of the crenarchaeotal order Thermoproteales. Its circular 1.84-megabase genome harbors no extrachromosomal elements and 2,051 open reading frames are identified, covering 90.6% of the complete sequence, which represents a high coding density. Derived from the gene content, T. tenax is a representative member of the Crenarchaeota. The organism is strictly anaerobic and sulfur-dependent with optimal growth at 86 degrees C and pH 5.6. One particular feature is the great metabolic versatility, which is not accompanied by a distinct increase of genome size or information density as compared to other Crenarchaeota. T. tenax is able to grow chemolithoautotrophically (CO2/H-2) as well as chemoorganoheterotrophically in presence of various organic substrates. All pathways for synthesizing the 20 proteinogenic amino acids are present. In addition, two presumably complete gene sets for NADH:quinone oxidoreductase (complex I) were identified in the genome and there is evidence that either NADH or reduced ferredoxin might serve as electron donor. Beside the typical archaeal A(0)A(1)-ATP synthase, a membrane-bound pyrophosphatase is found, which might contribute to energy conservation. Surprisingly, all genes required for dissimilatory sulfate reduction are present, which is confirmed by growth experiments. Mentionable is furthermore, the presence of two proteins (ParA family ATPase, actin-like protein) that might be involved in cell division in Thermoproteales, where the ESCRT system is absent, and of genes involved in genetic competence (DprA, ComF) that is so far unique within Archaea

Directory of Open Access Journals

Wageningen University & Research Publications

MPG.PuRe

CiteSeerX

Public Library of Science (PLOS)

TUbiblio

University of Regensburg Publication Server

Crossref

UCL Discovery

PubMed Central

Oxford University Research Archive

Genome Characterization of the Oleaginous Fungus Mortierella alpina

Author: A Ando
A Chang
A Goffeau
AP Simopoulos
Arthur J. Lustig
B Brugger
B Cresnar
B Dujon
Baixi Zhang
BJ Haas
BJ Haas
BJ Loftus
BJ Pettus
BR Braun
C Ratledge
C Ratledge
C Ratledge
CA Cuomo
CH Wu
Colin Ratledge
D Gordon
DH Haft
DR Scannell
DR Zerbino
DS Hibbett
E Espagne
E Quevillon
E Sakuradani
E Sakuradani
E Sakuradani
E Sakuradani
E Sakuradani
E Seif
EG Bligh
F Martin
FS Dietrich
G Rouser
H Streekstra
Haiqin Chen
Hao Zhang
HD Jang
HJ Pel
Hongchao Wang
Huanxin Zhang
I Korf
I Letunic
IB Lomakin
IM Berquin
Isabelle M. Berquin
J Amselem
J Bielawski
J Jurka
J Kamper
J Xu
James S. Norris
JE Galagan
JE Galagan
Jiansheng Wu
Junguo Shen
L Li
LD Metcalfe
Lei Wang
LF Thatcher
LJ Ma
Lu Feng
LV Michaelson
M Kanehisa
M Leibundgut
M Machida
M Margulies
M Stanke
MG Murray
Michael J. Thomas
MJ Chaisson
MR Andersen
N Hulo
N Rhind
Na Wang
ND Fedorova
PD Thomas
Peng Du
PK Bajpai
RA Burns
RA Dean
RA Hempenius
RD Finn
RL Tatusov
RL Tatusov
S Griffiths-Jones
S Jenni
S Jenni
SF Altschul
SM Goldberg
Suriguga Wang
SW White
SY Ho
T Maier
T Maier
TK Attwood
TM Lowe
TP Carr
TW Jeffries
V Ter-Hovhannisyan
V Wood
WC Nierman
Wei Chen
Wei Wang
WH Majoros
WS Van Kessel
Xiang Liu
Y Li
Y Zhang
Y Zhang
Yan Ren
Yang Li
Yanlin Yang
Yong Q. Chen
YQ Chen
Yuanda Song
Yun Feng
Zhennan Gu
Publication venue: Public Library of Science
Publication date
Field of study

Mortierella alpina is an oleaginous fungus which can produce lipids accounting for up to 50% of its dry weight in the form of triacylglycerols. It is used commercially for the production of arachidonic acid. Using a combination of high throughput sequencing and lipid profiling, we have assembled the M. alpina genome, mapped its lipogenesis pathway and determined its major lipid species. The 38.38 Mb M. alpina genome shows a high degree of gene duplications. Approximately 50% of its 12,796 gene models, and 60% of genes in the predicted lipogenesis pathway, belong to multigene families. Notably, M. alpina has 18 lipase genes, of which 11 contain the class 2 lipase domain and may share a similar function. M. alpina's fatty acid synthase is a single polypeptide containing all of the catalytic domains required for fatty acid synthesis from acetyl-CoA and malonyl-CoA, whereas in many fungi this enzyme is comprised of two polypeptides. Major lipids were profiled to confirm the products predicted in the lipogenesis pathway. M. alpina produces a complex mixture of glycerolipids, glycerophospholipids and sphingolipids. In contrast, only two major sterol lipids, desmosterol and 24(28)-methylene-cholesterol, were detected. Phylogenetic analysis based on genes involved in lipid metabolism suggests that oleaginous fungi may have acquired their lipogenic capacity during evolution after the divergence of Ascomycota, Basidiomycota, Chytridiomycota and Mucoromycota. Our study provides the first draft genome and comprehensive lipid profile for M. alpina, and lays the foundation for possible genetic engineering of M. alpina to produce higher levels and diverse contents of dietary lipids

Crossref

Directory of Open Access Journals

PubMed Central

InterPro in 2019: improving coverage, classification and access to protein sequence annotations

Author: Attwood TK
Babbitt PC
Blum M
Bork P
Bridge A
Brown SD
Chang H-Y
El-Gebali S
Finn RD
Fraser MI
Gough J
Haft DR
Huang H
Letunic I
Lopez R
Luciani A
Madeira F
Marchler-Bauer A
Mi H
Mitchell AL
Natale DA
Necci M
Nuka G
Orengo C
Pandurangan AP
Paysan-Lafosse T
Pesseat S
Potter SC
Qureshi MA
Rawlings ND
Redaschi N
Richardson LJ
Rivoire C
Salazar GA
Sangrador-Vegas A
Sigrist CJA
Sillitoe I
Sutton GG
Thanki N
Thomas PD
Tosatto SCE
Yong S-Y
Publication venue
Publication date: 06/11/2018
Field of study

The InterPro database (http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic interface and website. These developments extend and enrich the information provided by InterPro, and provide greater flexibility in terms of data access. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB, and discuss how our evaluation of residue coverage may help guide future curation activities

UCL Discovery

Grid and Databases: BaSTI as a Practical Integration Example

Author: A Dotter
A Marin-Franch
A Pietrinferni
A Pietrinferni
Adriano Pietrinferni
AW Irwin
AY Potekhin
B Koblitz
C Angulo
C Conroy
C Gallart
CA Iglesias
Claudio Vuerli
D Cordier
DA VandenBerg
DR Alexander
E Laure
F Pasian
Fabio Pasian
G Taffoni
Giuliano Taffoni
I Foster
J Frey
K Karasavvas
M Antonioletti
M Haft
Marco Molinaro
Maurizio Salaris
P Demarque
P Kacsuk
P Manzato
P Manzato
Patrizia Manzato
R Alfieri
S Percival
Santi Cassisi
SK Yi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

CRISPR - A widespread system that provides acquired resistance against phages in bacteria and archaea

Author: A Bolotin
A Ebihara
A Nakata
B Greve
C Bland
C Pourcel
CA Suttle
CJ Bult
DH Haft
DR Smith
EJ Sontheimer
FJ Mojica
FJ Mojica
FJ Mojica
GJ Hannon
HP Klenk
I Grissa
I Grissa
I Mokrousov
J Kamerbeek
JM Sturino
JS Godde
JT Crawford
K Brudey
KE Nelson
KE Wommack
KS Makarova
KS Makarova
LM Schouls
M Breitbart
M Sebaihia
P Durand
P Viswanathan
Philip Hugenholtz
PM Groenen
PW Hermans
R Barrangou
R Jansen
R Jansen
RA Edwards
RC Edgar
RK Lillestøl
Rotem Sorek
RT DeBoy
TH Tang
TH Tang
V Kunin
Victor Kunin
X Peng
Y Ishino
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2008
Field of study

Arrays of clustered, regularly interspaced short palindromic repeats (CRISPRs) are widespread in the genomes of many bacteria and almost all archaea. These arrays are composed of direct repeats that are separated by similarly sized non-repetitive spacers. CRISPR arrays, together with a group of associated proteins, confer resistance to phages, possibly by an RNA-interference-like mechanism. This Progress discusses the structure and function of this newly recognized antiviral mechanism

Crossref

University of Queensland eSpace