Search CORE

69 research outputs found

Recommended from our members

Towards a Library of Standard Operating Procedures (SOPs) for (meta)genomic annotation

Author: Angiuoli Samuel V.
Cochrane Guy
Field Dawn
Garrity George
Gussman Aaron
Klimke William
Kodira Chinnappa D.
Kyrpides Nikos
Kyrpides Nikos
Madupu Ramana
Markowitz Victor
Tatusova Tatiana
Thomson Nick
White Owen
Publication venue: Lawrence Berkeley National Laboratory
Publication date: 01/04/2008
Field of study

Genome annotations describe the features of genomes and accompany sequences in genome databases. The methodologies used to generate genome annotation are diverse and typically vary amongst groups. Descriptions of the annotation procedure are helpful in interpreting genome annotation data. Standard Operating Procedures (SOPs) for genome annotation describe the processes that generate genome annotations. Some groups are currently documenting procedures but standards are lacking for structure and content of annotation SOPs. In addition, there is no central repository to store and disseminate procedures and protocols for genome annotation. We highlight the importance of SOPs for genome annotation and endorse a central online repository of SOPs

UNT Digital Library

The Cassava Genome: Current Progress, Future Directions

Author: AA Raji
AC Roa
AP Chan
B Boher
BL Patil
Brian Desany
Chinnappa Kodira
Claude Fauquet
D Edwards
Daniel S. Rokhsar
F Awoleye
Fausto Rodriguez
GA Tuskan
H Ceballos
HM Lam
J Schmutz
Joseph Tohme
K Reilly
M Balat
Mohammed Mohiuddin
N Gill
NL Quinn
NM Springer
Pablo D. Rabinowicz
PM Schmitz
Pradeep Reddy Marri
RJ Elshire
RJ Hillocks
S Sraphet
S Tangphatsornruang
Simon Prochnik
Steve Rounsley
Timothy Harkins
VV Kapitonov
Publication venue: Springer-Verlag
Publication date: 01/01/2012
Field of study

The starchy swollen roots of cassava provide an essential food source for nearly a billion people, as well as possibilities for bioenergy, yet improvements to nutritional content and resistance to threatening diseases are currently impeded. A 454-based whole genome shotgun sequence has been assembled, which covers 69% of the predicted genome size and 96% of protein-coding gene space, with genome finishing underway. The predicted 30,666 genes and 3,485 alternate splice forms are supported by 1.4 M expressed sequence tags (ESTs). Maps based on simple sequence repeat (SSR)-, and EST-derived single nucleotide polymorphisms (SNPs) already exist. Thanks to the genome sequence, a high-density linkage map is currently being developed from a cross between two diverse cassava cultivars: one susceptible to cassava brown streak disease; the other resistant. An efficient genotyping-by-sequencing (GBS) approach is being developed to catalog SNPs both within the mapping population and among diverse African farmer-preferred varieties of cassava. These resources will accelerate marker-assisted breeding programs, allowing improvements in disease-resistance and nutrition, and will help us understand the genetic basis for disease resistance

Crossref

Springer - Publisher Connector

PubMed Central

CGSpace

Genomic Analysis of the Basal Lineage Fungus Rhizopus oryzae Reveals a Whole-Genome Duplication

Author: Abe Ayumi
Birren Bruce W.
Burger Gertraud
Butler Margi
Calvo Sarah E.
Corrochano Luis M.
Cuomo Christina A.
Elias Marek
Engels Reinhard
Fu Jianmin
Galagan James
Grabherr Manfred G.
Hansberg Wilhelm
Ibrahim Ashraf S.
Idnurm Alexander
Kim Jung-Mi
Kodira Chinnappa D.
Koehrsen Michael J.
Lang B. Franz
Liu Bo
Ma Li-Jun
Miranda-Saavedra Diego
O'Leary Sinead
Ortiz-Castellanos Lucila
Poulter Russell
Rodriguez-Romero Julio
Ruiz-Herrera José
Shen Yao-Qing
Skory Christopher
Sone Teruo
Wickes Brian L.
Zeng Qiandong
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Rhizopus oryzae is the primary cause of mucormycosis, an emerging, life-threatening infection characterized by rapid angioinvasive growth with an overall mortality rate that exceeds 50%. As a representative of the paraphyletic basal group of the fungal kingdom called “zygomycetes,” R. oryzae is also used as a model to study fungal evolution. Here we report the genome sequence of R. oryzae strain 99–880, isolated from a fatal case of mucormycosis. The highly repetitive 45.3 Mb genome assembly contains abundant transposable elements (TEs), comprising approximately 20% of the genome. We predicted 13,895 protein-coding genes not overlapping TEs, many of which are paralogous gene pairs. The order and genomic arrangement of the duplicated gene pairs and their common phylogenetic origin provide evidence for an ancestral whole-genome duplication (WGD) event. The WGD resulted in the duplication of nearly all subunits of the protein complexes associated with respiratory electron transport chains, the V-ATPase, and the ubiquitin–proteasome systems. The WGD, together with recent gene duplications, resulted in the expansion of multiple gene families related to cell growth and signal transduction, as well as secreted aspartic protease and subtilase protein families, which are known fungal virulence factors. The duplication of the ergosterol biosynthetic pathway, especially the major azole target, lanosterol 14α-demethylase (ERG11), could contribute to the variable responses of R. oryzae to different azole drugs, including voriconazole and posaconazole. Expanded families of cell-wall synthesis enzymes, essential for fungal cell integrity but absent in mammalian hosts, reveal potential targets for novel and R. oryzae-specific diagnostic and therapeutic treatments

CiteSeerX

ScholarWorks@UMass Amherst

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

idUS. Depósito de Investigación Universidad de Sevilla

Recommended from our members

Comparative Genomics of a Plant-Pathogenic Fungus, Pyrenophora tritici-repentis, Reveals Transduplication and the Impact of Repeat Elements on Pathogenicity and Population Divergence

Author: Berlin Aaron M.
Ciuffetti Lynda M.
Dhillon Braham
Figueroa Melania
Freitag Michael
Goodwin Stephen B.
Grigoriev Igor V.
Hane James K.
Henrissat Bernard
Holman Wade H.
Kodira Chinnappa D.
Ma Li-Jun
Manning Viola A.
Martin Joel
Oliver Richard P.
Pandelova Iovanna
Robbertse Barbara
Schackwitz Wendy
Schwartz David C.
Spatafora Joseph W.
Turgeon B. Gillian
Wilhelm Larry J.
Yandava Chandri
Young Sarah
Zeng Qiandong
Zhou Shiguo
Publication venue: 'Genetics Society of America'
Publication date
Field of study

Pyrenophora tritici-repentis is a necrotrophic fungus causal to the disease tan spot of wheat, whose contribution to crop loss has increased significantly during the last few decades. Pathogenicity by this fungus is attributed to the production of host-selective toxins ( HST), which are recognized by their host in a genotype-specific manner. To better understand the mechanisms that have led to the increase in disease incidence related to this pathogen, we sequenced the genomes of three P. tritici-repentis isolates. A pathogenic isolate that produces two known HSTs was used to assemble a reference nuclear genome of approximately 40 Mb composed of 11 chromosomes that encode 12,141 predicted genes. Comparison of the reference genome with those of a pathogenic isolate that produces a third HST, and a nonpathogenic isolate, showed the nonpathogen genome to be more diverged than those of the two pathogens. Examination of gene-coding regions has provided candidate pathogen-specific proteins and revealed gene families that may play a role in a necrotrophic lifestyle. Analysis of transposable elements suggests that their presence in the genome of pathogenic isolates contributes to the creation of novel genes, effector diversification, possible horizontal gene transfer events, identified copy number variation, and the first example of transduplication by DNA transposable elements in fungi. Overall, comparative analysis of these genomes provides evidence that pathogenicity in this species arose through an influx of transposable elements, which created a genetically flexible landscape that can easily respond to environmental changes.This is the publisher’s final pdf. The published article is copyrighted by The Genetics Society of America and can be found at: http://www.genetics-gsa.org/Keywords: ToxB, Anastomosis, Wheat (Triticum aestivum), Copy number variation, ToxA, Histone H3 transduplicationKeywords: ToxB, Anastomosis, Wheat (Triticum aestivum), Copy number variation, ToxA, Histone H3 transduplicatio

ScholarsArchive@OSU

Analysis of the Genome and Transcriptome of Cryptococcus neoformans var. grubii Reveals Complex RNA Expression and Microevolution Leading to Virulence Attenuation

Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence

Carolina Digital Repository

Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)

Author: A Dijkhuizen
A Saveliev
AG Jones
C Jeffrey
C Schlötterer
C Schlötterer
Chinnappa D Kodira
CR Primmer
D Tautz
DE MacHugh
DJ Somers
Douglas A Senalik
EA Sia
FC Serquen
G Bates
G Toth
GA Tuskan
H Ellegren
H Ellegren
H Ellegren
H Innan
H Wang
HJ Price
IRGSP (International Rice Genome Sequencing Project)
J Quackenbush
JA Eisen
JC Garza
JE Bowers
JH Mun
JH Peng
JL Weber
JM Bradeen
K Arumuganathan
L Cardle
L Santi
LD Knerr
Luming Yang
M Morgante
M Morgante
M Wierdl
MD Robbins
MG Murray
N Huo
N Watcharawongpaiboon
NN Fitzsimmons
O Jaillon
P Sebastian
Pablo F Cavagnaro
Philipp W Simon
Q Kong
R Gur-Arie
R Lister
R Wooster
RK Varshney
S Huang
S Leclercq
S Rozen
S Subramanian
S Temnykh
Sanwen Huang
SG Guo
SP Zhang
SR Kruglyak
SR McCouch
SS Renner
T Horejsi
T Thiel
Timothy T Harkins
TW Whitaker
V Meglic
W Powell
WC Kennard
Y Danin-Poleg
Y Kashi
Y Ren
Y Weng
Y Weng
Y Weng
Y Weng
YC Li
YH Han
YH Park
Yiqun Weng
Z Li
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Cucumber, <it>Cucumis sativus </it>L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are frequently favored as genetic markers due to their high level of polymorphism and codominant inheritance. Data from previously characterized genomes has shown that these repeats vary in frequency, motif sequence, and genomic location across taxa. During the last year, the genomes of two cucumber genotypes were sequenced including the Chinese fresh market type inbred line '9930' and the North American pickling type inbred line 'Gy14'. These sequences provide a powerful tool for developing markers in a large scale. In this study, we surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp assembled Gy14 DNA sequences, representing 55% of its nuclear genome, and in cucumber EST sequences. Similar analyses were performed in genomic and EST data from seven other plant species, and the results were compared with those of cucumber. Results A total of 112,073 perfect repeats were detected in the Gy14 cucumber genome sequence, accounting for 0.9% of the assembled Gy14 genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were the most frequent microsatellites in genomic DNA sequence, dinucleotide repeats, which had more repeat units than any other SSR type, had the highest cumulative sequence length. Coding regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic sequence, with trinucleotides predominating in EST sequences. AAG was the most frequent repeat in cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to the other species examined, cucumber genomic sequence had the highest density of SSRs (although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between 9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism (48.3%) between the two genotypes. The level of polymorphism seems to be positively associated with the number of repeat units in the microsatellite. The <it>in silico </it>PCR results were validated empirically in 660 of the 1,006 SSR loci. In addition, primer sequences for more than 83,000 newly-discovered cucumber microsatellites, and their exact positions in the Gy14 genome assembly were made publicly available. Conclusions The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats were characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Comparative and Functional Genomics of Rhodococcus opacus PD630 for Biofuels Development

Author: A Arakaki
A Argyrou
A Marchler-Bauer
A Pohlmann
A Stamatakis
AF Alvarez
AI Saeed
AJ Enright
AK Pandey
AL Delcher
AL Delcher
Alex L. B. Leach
AM Waterhouse
Anthony C. DeBono
Anthony J. Sinskey
AR Horswill
Brian Desany
Bruce W. Birren
C Kaddor
Chinnappa D. Kodira
Christine Dancel
Christopher A. Desjardins
D Jendrossek
D Portevin
D Post
DE Vance
Dirk Gevers
DL Rainwater
DL Rainwater
DP MacEachran
E Puglisi
E Schwartz
E Schweizer
E Severi
E Severi
E Vimr
ER Goncalves
F Abascal
F David
F-F Hsu
G Timmins
HM Alvarez
HM Alvarez
HM Alvarez
I Letunic
I Matsunaga
IB Lomakin
IC Sutcliffe
Ion Ghiviriga
J Hughes
J Rengarajan
Jason P. Affourtit
Jason W. Holder
Jeremy Zucker
Jil C. Ulrich
JM Mathieu
K Isono
K Katoh
K Kurosawa
K Lagesen
K Raman
KC Yam
KR Robrock
L Diacovich
L Li
M Brudno
M Green
M Hernandez
M Seto
M Wu
MA Larkin
MJ de Hoon
MP Mansour
MP McLeod
O Lenz
O Zimhony
OP Peoples
PA Lessard
Paul A. Godfrey
Paul M. Richardson
PD Karp
PR Romero
Qiandong Zeng
R Edgar
R Gande
R Gande
R Kalscheuer
R Van der Geize
RD Finn
RL Hunter
S Griffiths-Jones
S Guindon
S Kikuchi
S Rajakumari
SC Slater
SK Parker
T Chopra
T Lee
T Sirakova
TD Sirakova
TD Sirakova
Thomas Abeel
TM Lowe
U Grafe
X Yang
Y Hu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

The Actinomycetales bacteria Rhodococcus opacus PD630 and Rhodococcus jostii RHA1 bioconvert a diverse range of organic substrates through lipid biosynthesis into large quantities of energy-rich triacylglycerols (TAGs). To describe the genetic basis of the Rhodococcus oleaginous metabolism, we sequenced and performed comparative analysis of the 9.27 Mb R. opacus PD630 genome. Metabolic-reconstruction assigned 2017 enzymatic reactions to the 8632 R. opacus PD630 genes we identified. Of these, 261 genes were implicated in the R. opacus PD630 TAGs cycle by metabolic reconstruction and gene family analysis. Rhodococcus synthesizes uncommon straight-chain odd-carbon fatty acids in high abundance and stores them as TAGs. We have identified these to be pentadecanoic, heptadecanoic, and cis-heptadecenoic acids. To identify bioconversion pathways, we screened R. opacus PD630, R. jostii RHA1, Ralstonia eutropha H16, and C. glutamicum 13032 for growth on 190 compounds. The results of the catabolic screen, phylogenetic analysis of the TAGs cycle enzymes, and metabolic product characterizations were integrated into a working model of prokaryotic oleaginy.Cambridge-MIT InstituteMassachusetts Institute of Technology. (Seed Grant program)Shell Oil CompanyNational Institute of Allergy and Infectious Diseases (U.S.)United States. National Institutes of HealthNational Institutes of Health. Department of Health and Human Services (Contract No. HHSN272200900006C

CiteSeerX

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central

Genomic Analysis of the Hydrocarbon-Producing, Cellulolytic, Endophytic Fungus Ascocoryne sarcoides

Author: A Andreou
A Mortazavi
A Schirmer
A. Michael Sismour
AN Grechkin
Andrea Sboner
AS Sarpal
B Langmead
BL Cantarel
BL Cantarel
Brian F. Dunican
Cambria J. Alpha
CG Kumar
Chinnappa Kodira
CR Fischer
D Croes
D Fitzpatrick
D Martinez
DA Benson
Daniel J. Spakowicz
DJ Sukovich
DR Dodds
E Combet
EG James
EM Marcotte
EV Koonin
F Brodhun
F Ozsolak
G Del Sorbo
G Parra
G Stephanopoulos
G Strobel
G Yadav
G Yadav
GA Strobel
GA Strobel
George M. Church
H Redestig
HR Beller
I Alam
J Berdy
J Gao
J Piel
JL Fortman
JO Korbel
L Habegger
M Ashburner
M Askenazi
M Kanehisa
M Margulies
M Pellegrini
M Saloheimo
M Wink
M Wurzenberger
M Wurzenberger
MA Griffin
MA Griffin
MA Rude
Mark B. Gerstein
MB Arnaud
MB Gerstein
Meghan A. Griffin
Michael Egholm
MY Hirai
N Saito
PH Bradley
R Caspi
R Overbeek
R Tressl
R Tressl
RD Finn
S Bouhired
S Murahashi
SA Rahman
Scott A. Strobel
SF Altschul
SK Lee
SL Bumgarner
SL Camera
SR Eddy
Sébastien Monchy
T Hancock
Tara A. Gianoulis
U Nagalakshmi
W Wieloch
WJ Kent
X-ai Chen
Y Moriya
Y Moriya
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The microbial conversion of solid cellulosic biomass to liquid biofuels may provide a renewable energy source for transportation fuels. Endophytes represent a promising group of organisms, as they are a mostly untapped reservoir of metabolic diversity. They are often able to degrade cellulose, and they can produce an extraordinary diversity of metabolites. The filamentous fungal endophyte Ascocoryne sarcoides was shown to produce potential-biofuel metabolites when grown on a cellulose-based medium; however, the genetic pathways needed for this production are unknown and the lack of genetic tools makes traditional reverse genetics difficult. We present the genomic characterization of A. sarcoides and use transcriptomic and metabolomic data to describe the genes involved in cellulose degradation and to provide hypotheses for the biofuel production pathways. In total, almost 80 biosynthetic clusters were identified, including several previously found only in plants. Additionally, many transcriptionally active regions outside of genes showed condition-specific expression, offering more evidence for the role of long non-coding RNA in gene regulation. This is one of the highest quality fungal genomes and, to our knowledge, the only thoroughly annotated and transcriptionally profiled fungal endophyte genome currently available. The analyses and datasets contribute to the study of cellulose degradation and biofuel production and provide the genomic foundation for the study of a model endophyte system

CiteSeerX

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

FigShare

Sequencing of Culex quinquefasciatus establishes a platform for mosquito comparative genomics

Author: Abrudan Jenica
Amedeo Paolo
Antelo Beatriz
Arensburger Peter
Atkinson Peter W.
Bartholomay Lyric
Bidwell Shelby
Birren Bruce
Caler Elisabet
Camara Francisco
Campbell Corey L.
Campbell Kathryn S.
Casola Claudio
Castro Marta T.
Chandramouliswaran Ishwar
Chapman Sinéad B.
Christensen Bruce M.
Christley Scott
Collins Frank H.
Cornel Anthony
Costas Javier
Dimopoulos George
Eisenstadt Eric
Feschotte Cedric
Fraser-Liggett Claire
Guigó Serra Roderic
Haas Brian
Hammond Martin
Hannick Linda I.
Hansson Bill S.
Hemingway Janet
Higgs Stephen
Hill Sharon
Howarth Clint
Ignell Rickard
Kennedy Ryan C.
Kodira Chinnappa D.
Lanzaro Gregory C.
Lawson Daniel
Lee Norman H.
Liu Nannan
Lobo Neil F.
Mao Chunhong
Mayhew George
Megy Karine
Michel Kristin
Mori Akio
Muskavitch Marc A. T.
Naveira Horacio
Nene Vishvanath
Nguyen Nam
Pearson Matthew D.
Pritham Ellen J.
Puiu Daniela
Qi Yumin
Raikhel Alexander S.
Ranson Hilary
Ribeiro Jose M. C.
Roberston Hugh M.
Severson David W.
Shumway Martin
Stanke Mario
Strausberg Robert
Sun Cheng
Sutton Granger
Tu Zhijian (Jake)
Tubio Jose M. C.
Unger Maria F
Vanlandingham Dana L.
Vilella Albert J.
Waterhouse Robert M.
White Jared R.
White Owen
Wondji Charles S.
Wortman Jennifer
Zdobnov Evgeny M.
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 24/07/2018
Field of study

Culex quinquefasciatus (the southern house mosquito) is an important mosquito vector of viruses such as West Nile virus and St. Louis encephalitis virus, as well as of nematodes that cause lymphatic filariasis. C. quinquefasciatus is one species within the Culex pipiens species complex and can be found throughout tropical and temperate climates of the world. The ability of C. quinquefasciatus to take blood meals from birds, livestock, and humans contributes to its ability to vector pathogens between species. Here, we describe the genomic sequence of C. quinquefasciatus: Its repertoire of 18,883 protein-coding genes is 22% larger than that of Aedes aegypti and 52% larger than that of Anopheles gambiae with multiple gene-family expansions, including olfactory and gustatory receptors, salivary gland genes, and genes associated with xenobiotic detoxification

Diposit Digital de la Universitat de Barcelona