Search CORE

73 research outputs found

VIGOR, an annotation program for small viral genomes

Author: A Gradi
AC Palmenberg
AL Kistler
DA Steinhauer
David Spiro
DJ Deming
F Dos Ramos
FB Guo
J Besemer
J Matthijnssens
J Zhang
J Ziebuhr
Jaideep P Sundaram
KG Nicholson
KP Alekseev
L Kiemer
LL Chen
M Borodovsky
M Hasoksuz
MK Estes
R Mills
R Sharma
RL Graham
SF Altschul
Shiliang Wang
SM McDonald
SM Mount
T Horimoto
W Chen
W Li
X Zhang
X Zhang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The decrease in cost for sequencing and improvement in technologies has made it easier and more common for the re-sequencing of large genomes as well as parallel sequencing of small genomes. It is possible to completely sequence a small genome within days and this increases the number of publicly available genomes. Among the types of genomes being rapidly sequenced are those of microbial and viral genomes responsible for infectious diseases. However, accurate gene prediction is a challenge that persists for decoding a newly sequenced genome. Therefore, accurate and efficient gene prediction programs are highly desired for rapid and cost effective surveillance of RNA viruses through full genome sequencing. Results We have developed VIGOR (Viral Genome ORF Reader), a web application tool for gene prediction in influenza virus, rotavirus, rhinovirus and coronavirus subtypes. VIGOR detects protein coding regions based on sequence similarity searches and can accurately detect genome specific features such as frame shifts, overlapping genes, embedded genes, and can predict mature peptides within the context of a single polypeptide open reading frame. Genotyping capability for influenza and rotavirus is built into the program. We compared VIGOR to previously described gene prediction programs, ZCURVE_V, GeneMarkS and FLAN. The specificity and sensitivity of VIGOR are greater than 99% for the RNA viral genomes tested. Conclusions VIGOR is a user friendly web-based genome annotation program for five different viral agents, influenza, rotavirus, rhinovirus, coronavirus and SARS coronavirus. This is the first gene prediction program for rotavirus and rhinovirus for public access. VIGOR is able to accurately predict protein coding genes for the above five viral types and has the capability to assign function to the predicted open reading frames and genotype influenza virus. The prediction software was designed for performing high throughput annotation and closure validation in a post-sequencing production pipeline.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

Author: A Bateman
A Nekrutenko
AC McHardy
AG Murzin
CA Orengo
D Fischer
DB Rusch
DH Haft
DL Wheeler
E Birney
ED Harrington
EF DeLong
EF DeLong
F Corpet
F Sanger
FMDL Vega
Granger Sutton
GW Tyson
H Noguchi
H Ochman
J Besemer
J Quackenbush
JA Eisen
JC Venter
K Chen
K Mavromatis
L Krause
L Rychlewski
M Margulies
M Sait
N Siew
R Seshadri
R Unger
RC Edgar
S Yooseph
SF Altschul
SF Altschul
SG Tringe
Shibu Yooseph
SJ Giovannoni
SR Gill
W Li
W Li
W Li
Weizhong Li
Z Yang
Z Yang
Publication venue: BioMed Central
Publication date: 01/04/2008
Field of study

Abstract Background The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. Results We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net). Conclusion The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Complete Sequencing of the blaNDM-1-Positive IncA/C Plasmid from Escherichia coli ST38 Isolate Suggests a Possible Origin from Plant Pathogens

Author: A Carattoli
A Carattoli
A Muga
AB Olson
AC Fluit
AK van der Bij
Akira Hishinuma
B Zheng
D Yong
DR Call
Fumihiko Takeuchi
G Cornaglia
G Peirano
G Peirano
H Bao
H Li
HW Boucher
J Besemer
JM Rolain
JT Simpson
K Bush
K Tamura
KK Kumarasamy
Kunikazu Yamane
L Poirel
M Castanheira
Makoto Kuroda
Makoto Ohnishi
Mari Matsui
MM D'Andrea
MM Rodriguez
ND Grindley
Niyaz Ahmed
O Samuelsen
PA Lund
PB Wyrick
PL Ho
R Canton
RC Moellering Jr
S Chihara
S Suzuki
SF Altschul
T Carver
T Yamamoto
TR Walsh
Tsuyoshi Sekizuka
Y Pfeifer
Yoshichika Arakawa
Publication venue: Public Library of Science
Publication date: 23/09/2011
Field of study

The complete sequence of the plasmid pNDM-1_Dok01 carrying New Delhi metallo-β-lactamase (NDM-1) was determined by whole genome shotgun sequencing using Escherichia coli strain NDM-1_Dok01 (multilocus sequence typing type: ST38) and the transconjugant E. coli DH10B. The plasmid is an IncA/C incompatibility type composed of 225 predicted coding sequences in 195.5 kb and partially shares a sequence with blaCMY-2-positive IncA/C plasmids such as E. coli AR060302 pAR060302 (166.5 kb) and Salmonella enterica serovar Newport pSN254 (176.4 kb). The blaNDM-1 gene in pNDM-1_Dok01 is terminally flanked by two IS903 elements that are distinct from those of the other characterized NDM-1 plasmids, suggesting that the blaNDM-1 gene has been broadly transposed, together with various mobile elements, as a cassette gene. The chaperonin groES and groEL genes were identified in the blaNDM-1-related composite transposon, and phylogenetic analysis and guanine-cytosine content (GC) percentage showed similarities to the homologs of plant pathogens such as Pseudoxanthomonas and Xanthomonas spp., implying that plant pathogens are the potential source of the blaNDM-1 gene. The complete sequence of pNDM-1_Dok01 suggests that the blaNDM-1 gene was acquired by a novel composite transposon on an extensively disseminated IncA/C plasmid and transferred to the E. coli ST38 isolate

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Tm1: A Mutator/Foldback Transposable Element Family in Root-Knot Nematodes

Author: A Hua-Van
A Hugall
A Lomsadze
AC Triantaphyllou
AC Triantaphyllou
AC Triantaphyllou
AF Bird
AJ Drummond
AJ Windsor
AP Feinberg
B McClintock
C Branch
C Cheng
C Feschotte
C Feschotte
C Opperman
C Wang
C Zijlstra
CA Gleason
CP Marquez
D Ghosh
D Lisch
D Lisch
D Lisch
D Rebatchouk
DH Lunt
DJ Finnegan
DJ Lampe
DL Arnold
DL Arnold
DL Trudgill
DM Bird
F Casals
F Chalvet
G Nombela
GR Hicks
H Leroy
H Takami
I Kaloshian
J Besemer
J Sambrook
J Xu
JA Eisen
JC Gilbert
JC Silva
JC Silva
JH Xu
JK Lim
JK Pace 2nd
JL Bennetzen
JN Sasser
JP Semblat
K Nakai
L Haren
L Sinzelle
M Chandler
M Cáceres
M Gause
M Marzo
M Rossi
MA McClure
MG Kidwell
MG Kidwell
MG Kidwell
MM Babu
N Jiang
N Kaplan
P Abad
P Castagnone-Sereno
P Castagnone-Sereno
P Tailliez
PA Mieczkowski
QL Liu
QL Liu
R Chenna
R Dorn
R Keall
R Quintiliani Jr
RD Finn
Richard Cordaux
S Wright
SC Bost
SF Altschul
SF Altschul
SR Wessler
SS Potter
Stephen M. Gross
T Blumenthal
T Takumi
TD Schneider
TO Powers
Valerie M. Williamson
VM Williamson
WF Marzluff
WS Barham
X Diao
Z Xu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Three closely related parthenogenetic species of root-knot nematodes, collectively termed the Meloidogyne incognita-group, are economically significant pathogens of diverse crop species. Remarkably, these asexual root-knot nematodes are capable of acquiring heritable changes in virulence even though they lack sexual reproduction and meiotic recombination. Characterization of a near isogenic pair of M. javanica strains differing in response to tomato with the nematode resistance gene Mi-1 showed that the virulent strain carried a deletion spanning a gene called Cg-1. Herein, we present evidence that the Cg-1 gene lies within a member of a novel transposable element family (Tm1; Transposon in Meloidogyne-1). This element family is defined by composite terminal inverted repeats of variable lengths similar to those of Foldback (FB) transposable elements and by 9 bp target site duplications. In M. incognita, Tm1 elements can be classified into three general groups: 1) histone-hairpin motif elements; 2) MITE-like elements; 3) elements encoding a putative transposase. The predicted transposase shows highest similarity to gene products encoded by aphids and mosquitoes and resembles those of the Phantom subclass of the Mutator transposon superfamily. Interestingly, the meiotic, sexually-reproducing root-knot nematode species M. hapla has Tm1 elements with similar inverted repeat termini, but lacks elements with histone hairpin motifs and contains no elements encoding an intact transposase. These Tm1 elements may have impacts on root-knot nematode genomes and contribute to genetic diversity of the asexual species

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Leaderless genes in bacteria: clue to the evolution of translation initiation mechanisms in prokaryotes

Author: A Bolotin
A Henne
A Sola-Landa
AC Kaberdina
AL Delcher
B Chang
BE Moseley
CJ Wu
D Benelli
DE Andreev
E Torarinsson
FD Ciccarelli
Gang-Qing Hu
GE Crooks
GP van Wezel
GQ Hu
GQ Hu
GR Janssen
H Chen
H Nothaft
HJ Hong
HQ Zhu
HQ Zhu
Huaiqiu Zhu
I Moll
J Besemer
J Ma
J Shine
JA Lake
JS Hahn
K Chin
M Brenneis
M Jiang
M Kozak
M Ptashne
M Ventura
MA Larkin
MM Slupska
MN Price
MS Paget
N Tolstrup
NJ Ryding
O Hering
P Dam
P Londei
PA Hoskisson
R Hershberg
RK Shultzaberger
RL Tatusov
S Grill
S Kumar
S Nakagawa
T Sazuka
T Udagawa
T Umeyama
TB Anderson
V Mazurakova
WP Revill
Xiaobin Zheng
Zhen-Su She
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Shine-Dalgarno (SD) signal has long been viewed as the dominant translation initiation signal in prokaryotes. Recently, leaderless genes, which lack 5'-untranslated regions (5'-UTR) on their mRNAs, have been shown abundant in archaea. However, current large-scale <it>in silico </it>analyses on initiation mechanisms in bacteria are mainly based on the SD-led initiation way, other than the leaderless one. The study of leaderless genes in bacteria remains open, which causes uncertain understanding of translation initiation mechanisms for prokaryotes. Results Here, we study signals in translation initiation regions of all genes over 953 bacterial and 72 archaeal genomes, then make an effort to construct an evolutionary scenario in view of leaderless genes in bacteria. With an algorithm designed to identify multi-signal in upstream regions of genes for a genome, we classify all genes into SD-led, TA-led and atypical genes according to the category of the most probable signal in their upstream sequences. Particularly, occurrence of TA-like signals about 10 bp upstream to translation initiation site (TIS) in bacteria most probably means leaderless genes. Conclusions Our analysis reveals that leaderless genes are totally widespread, although not dominant, in a variety of bacteria. Especially for <it>Actinobacteria </it>and <it>Deinococcus-Thermus</it>, more than twenty percent of genes are leaderless. Analyzed in closely related bacterial genomes, our results imply that the change of translation initiation mechanisms, which happens between the genes deriving from a common ancestor, is linearly dependent on the phylogenetic relationship. Analysis on the macroevolution of leaderless genes further shows that the proportion of leaderless genes in bacteria has a decreasing trend in evolution.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Shotgun sequencing of Yersinia enterocolitica strain W22703 (biotype 2, serotype O:9): genomic evidence for oscillation between invertebrates and mammals

Abstract Background <it>Yersinia enterocolitica </it>strains responsible for mild gastroenteritis in humans are very diverse with respect to their metabolic and virulence properties. Strain W22703 (biotype 2, serotype O:9) was recently identified to possess nematocidal and insecticidal activity. To better understand the relationship between pathogenicity towards insects and humans, we compared the W22703 genome with that of the highly pathogenic strain 8081 (biotype1B; serotype O:8), the only <it>Y. enterocolitica </it>strain sequenced so far. Results We used whole-genome shotgun data to assemble, annotate and analyse the sequence of strain W22703. Numerous factors assumed to contribute to enteric survival and pathogenesis, among them osmoregulated periplasmic glucan, hydrogenases, cobalamin-dependent pathways, iron uptake systems and the <it>Yersinia </it>genome island 1 (YGI-1) involved in tight adherence were identified to be common to the 8081 and W22703 genomes. However, sets of ~550 genes revealed to be specific for each of them in comparison to the other strain. The plasticity zone (PZ) of 142 kb in the W22703 genome carries an ancient flagellar cluster Flg-2 of ~40 kb, but it lacks the pathogenicity island YAPIYe, the secretion system <it>ysa </it>and <it>yts1</it>, and other virulence determinants of the 8081 PZ. Its composition underlines the prominent variability of this genome region and demonstrates its contribution to the higher pathogenicity of biotype 1B strains with respect to W22703. A novel type three secretion system of mosaic structure was found in the genome of W22703 that is absent in the sequenced strains of the human pathogenic <it>Yersinia </it>species, but conserved in the genomes of the apathogenic species. We identified several regions of differences in W22703 that mainly code for transporters, regulators, metabolic pathways, and defence factors. Conclusion The W22703 sequence analysis revealed a genome composition distinct from other pathogenic <it>Yersinia enterocolitica </it>strains, thus contributing novel data to the <it>Y. enterocolitica </it>pan-genome. This study also sheds further light on the strategies of this pathogen to cope with its environments.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Shotgun Sequencing Analysis of Trypanosoma cruzi I Sylvio X10/1 and Comparison with T. cruzi VI CL Brener

Author: A MacLeod
A Pedroso
A Rassi Jr
AC Ivens
AP Jackson
B Zingales
B Zingales
Björn Andersson
C Barnabe
C Barnabe
C Branche
C Sanchez-Guillen Mdel
CA Machado
CR Machado
CR Marinho
CS Peacock
DB Weatherly
Ellen Sherwood
EM de Souza
J Besemer
J Jerlstrom-Hultqvist
J Telleria
J Telleria
JA Dvorak
JC Engel
JD Ramirez
JM Burgos
JM de Freitas
JR Miller
L Augusto-Pinto
L Jenni
LE Fichera
M Aslett
M Berriman
M Margulies
M Postan
M Virreira
M Yeo
MA Miles
MA Miles
MA Miles
Martin S. Llewellyn
MC Elias
MD Chapman
MD Lewis
ME Pereira
MF Bosseno
Michael A. Miles
Michael D. Lewis
MS Llewellyn
MV Cardinal
MW Gaunt
N Vargas
NM El-Sayed
NS Akopyants
O Franzen
OR Bininda-Emonds
Oscar Franzén
Paul Andrew Bates
R Nielsen
RC Ruiz
RL Tarleton
S Brisse
S Kurtz
S Ocana-Mayorga
S Revollo
SF Breniere
SR Eddy
Stephen Ochaya
TJ Carver
V Rougeron
W Li
Z Yang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Trypanosoma cruzi is the causative agent of Chagas disease, which affects more than 9 million people in Latin America. We have generated a draft genome sequence of the TcI strain Sylvio X10/1 and compared it to the TcVI reference strain CL Brener to identify lineage-specific features. We found virtually no differences in the core gene content of CL Brener and Sylvio X10/1 by presence/absence analysis, but 6 open reading frames from CL Brener were missing in Sylvio X10/1. Several multicopy gene families, including DGF, mucin, MASP and GP63 were found to contain substantially fewer genes in Sylvio X10/1, based on sequence read estimations. 1,861 small insertion-deletion events and 77,349 nucleotide differences, 23% of which were non-synonymous and associated with radical amino acid changes, further distinguish these two genomes. There were 336 genes indicated as under positive selection, 145 unique to T. cruzi in comparison to T. brucei and Leishmania. This study provides a framework for further comparative analyses of two major T. cruzi lineages and also highlights the need for sequencing more strains to understand fully the genomic composition of this parasite

Public Library of Science (PLOS)

Crossref

LSHTM Research Online

Directory of Open Access Journals

PubMed Central

Enlighten

Capturing Single Cell Genomes of Active Polysaccharide Degraders: An Unexpected Contribution of Verrucomicrobia

Author: A Marchler-Bauer
A Raghunathan
A Stamatakis
AB Boraston
AC Alderkamp
B Biddanda
BH Park
BK Swan
BL Cantarel
Brandon K. Swan
Brian Thompson
C Arnosti
C Arnosti
C Arnosti
C Arnosti
Carol Arnosti
CG Bruckner
Cheryl D. Gleasner
Chien-Chi Lo
Chris J. Detter
CJA Sigrist
D Hoefel
Dashiell E. D. Masland
David M. Brazel
DH Buckley
DJ Lane
DR Zerbino
E Pruesse
EA Bayer
EJ Fleming
EM Rubin
EO Casamayor
F Kiefer
FB Dean
FO Glöckner
G Petroni
G Zwart
Gary Xie
H Teeling
HS Yoon
HW Blanch
J Arnds
J Besemer
J Goris
JA Fuhrman
Jacques Ravel
JF Kennedy
JL Heywood
JM Brulc
JM Gasol
K Zhang
K-J Chin
Kai Ziervogel
Krista G. Reitenga
M Derrien
M Hamady
M Hess
M Martinez-Garcia
M Martinez-Garcia
M Richter
M Scheuermayer
M Sieracki
M Weiss
Manuel Martinez-Garcia
ME Sieracki
Monica Lluesma Gomez
MS Rappe
MT Cottrell
ND Rawlings
Nicole J. Poulton
Patrick S. G. Chain
PB Pope
R Stepanauskas
Ramunas Stepanauskas
RM Weiner
S Freitas
S Kurtz
S Ullrich
Sanaa Ahmed
SF Altschul
T Woyke
T Woyke
VM Markowitz
W Ludwig
Wendy K. Bellows
X Zeng
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Microbial hydrolysis of polysaccharides is critical to ecosystem functioning and is of great interest in diverse biotechnological applications, such as biofuel production and bioremediation. Here we demonstrate the use of a new, efficient approach to recover genomes of active polysaccharide degraders from natural, complex microbial assemblages, using a combination of fluorescently labeled substrates, fluorescence-activated cell sorting, and single cell genomics. We employed this approach to analyze freshwater and coastal bacterioplankton for degraders of laminarin and xylan, two of the most abundant storage and structural polysaccharides in nature. Our results suggest that a few phylotypes of Verrucomicrobia make a considerable contribution to polysaccharide degradation, although they constituted only a minor fraction of the total microbial community. Genomic sequencing of five cells, representing the most predominant, polysaccharide-active Verrucomicrobia phylotype, revealed significant enrichment in genes encoding a wide spectrum of glycoside hydrolases, sulfatases, peptidases, carbohydrate lyases and esterases, confirming that these organisms were well equipped for the hydrolysis of diverse polysaccharides. Remarkably, this enrichment was on average higher than in the sequenced representatives of Bacteroidetes, which are frequently regarded as highly efficient biopolymer degraders. These findings shed light on the ecological roles of uncultured Verrucomicrobia and suggest specific taxa as promising bioprospecting targets. The employed method offers a powerful tool to rapidly identify and recover discrete genomes of active players in polysaccharide degradation, without the need for cultivation

Repositorio Institucional de la Universidad de Alicante

Public Library of Science (PLOS)

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

PubMed Central

Carolina Digital Repository

FigShare

Full genome SNP-based phylogenetic analysis reveals the origin and global spread of Brucella melitensis

Author: A Bairoch
A Cloeckaert
A Robinson
AC Darling
AF Kaufmann
AH Kampen
AR Wattam
AR Wattam
AR Wattam
AR Wattam
C Marianelli
Chee-Choong Hoh
DR Franz
E Moreno
F Wang
Faizatul Lela Jafar
G Girault
G Morelli
GL Kay
H Ogata
IC Sam
J Besemer
J Ding
J Godfroid
J Song
J-M Verger
JK Bonfield
JP Huelsenbeck
JT Foster
JT Foster
K Lagesen
Kim-Kee Tan
Kok Wei Lee
L Capasso
L Capasso
L Song
LA Corner
Li-Yen Chang
M Herzberg
M Sayan
MA Al-Garadi
ME Meyer
ME Meyer
MJ Corbel
Mohd Noor Mat Isa
MZ Jama'ayah
NE Lucero
P Schattner
PE Chen
PH Bamaiyi
S Valdezate
Sazaly AbuBakar
SF Altschul
Siti Sarah Nore
SS Elberg
TD Read
V Mick
V Wongphruksasoong
W Liu
W Manosuthi
Wai-Yan Yee
Y Ke
Y Ke
Yung-Chie Tan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

De novo Assembly of a 40 Mb Eukaryotic Genome from Short Sequence Reads: Sordaria macrospora, a Model Organism for Fungal Morphogenesis

Author: A Blumenstein
A Conesa
A Debets
A Goffeau
A Hamann
A Idnurm
A Krogh
A Storlazzi
A Storlazzi
AC Froehlich
AD van Diepeningen
AJ Griffiths
AJ Powell
AM Waterhouse
B Kunstmann
B Kunstmann
B McClintock
BG Hall
Birgit Knab
BS Margolin
C Rech
CA Cuomo
CM Fraser
CM O'Gorman
CN Dewey
CT Walsh
D Hoffmeister
D van Heemst
D Zickler
D Zickler
D Zickler
DB Archer
DD Perkins
DD Perkins
Denise Zickler
DH Huson
DJ Jacobson
DJ Jacobson
DJ Jacobson
DJ Jacobson
DL Hawksworth
DR Zerbino
DW Lee
E Birney
E Branscomb
E Espagne
E Espagne
EM Zdobnov
EP Nawrocki
Eric Espagne
F Debets
F Graia
F Kempken
F Kempken
F Malagnac
F Ronquist
Frank Kempken
GE Tusnady
GW Beadle
H Linden
H Taquist
HD Osiewacz
Heinz D. Osiewacz
Hsiao-Che Kuo
I Braumann
I Braumann
I Engh
I Engh
I Kaneko
I Korf
Ines Engh
J Besemer
J Jurka
J Kinsey
J Mata
J Purschwitz
J Shendure
J Wu
JA Bieszke
JA Reinhardt
Jason E. Stajich
JD Bendtsen
JD Thompson
JE Galagan
JE Galagan
JE Stajich
JE Stajich
Jens Kamerewerd
JHC Hoge
JJ Coleman
JK Hane
JP Huelsenbeck
JP Rasmussen
K Dementhon
K Esser
K Esser
K Groebe
K Ikeda
KA Borkovich
Karen Halliday
KR Pomraning
Kristina M. Smith
L Li
M Buckley
M Freitag
M Lynch
M Margulies
M Nowrousian
M Nowrousian
M Nowrousian
M Nowrousian
M Nowrousian
M Nowrousian
M Nowrousian
M Nowrousian
M Orbach
M Paoletti
M Stanke
M Stanke
M Walz
M Wu
Meiling Chu
Michael Freitag
Minou Nowrousian
MJ Daboussi
ML Smith
N Fedorova
N Hunter
N Khaldi
N Khaldi
N Mir-Rashed
N Whiteford
NB Averbeck
ND Fedorova
ND Read
ND Read
Nick D. Read
NJ Patron
NL Glass
NP Keller
OC Micali
OM Mylek
P Ballario
P Cortesi
P Horton
Paul M. Richardson
PK Shiu
PKT Shiu
PS Schnable
Q He
Q Liu
R Engels
R Li
R Page
RA Dean
RD Finn
RJ Cox
RW Harding
S DiGuistini
S Garcia-Vallvé
S Kroken
S Masloff
S Pöggeler
S Pöggeler
S Pöggeler
S Pöggeler
S Pöggeler
S Pöggeler
S Pöggeler
S Pöggeler
S Sarkar
S Saupe
SB Malik
SE Smith
SF Altschul
SL Page
SR Eddy
Stefanie Pöggeler
Stephan Seiler
T Kasuga
TL Friesen
TM Lowe
U Kück
U Kück
U Schlecht
UL Rosewich
Ulrich Kück
V Fulci
VV Kapitonov
YJ Liu
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30–90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in ∼4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

eScholarship - University of California

The University of Manchester - Institutional Repository

Hochschulschriftenserver - Universität Frankfurt am Main