Search CORE

66 research outputs found

Computational methods for transcriptome annotation and quantification using RNA-seq

Author: Garber Manuel
Grabherr Manfred G.
Guttman Mitchell
Trapnell Cole
Publication venue: Nature Publishing Group
Publication date: 01/06/2011
Field of study

High-throughput RNA sequencing (RNA-seq) promises a comprehensive picture of the transcriptome, allowing for the complete annotation and quantification of all genes and their isoforms across samples. Realizing this promise requires increasingly complex computational methods. These computational challenges fall into three main categories: (i) read mapping, (ii) transcriptome reconstruction and (iii) expression quantification. Here we explain the major conceptual and practical challenges, and the general classes of solutions for each category. Finally, we highlight the interdependence between these categories and discuss the benefits for different biological applications

Metatranscriptomics captures dynamic shifts in mycorrhizal coordination in boreal forests

Author: Castro David
Daguerre Yohann
Grabherr Manfred G.
Hurry Vaughan
Law Simon R.
Näsholm Torgny
Schneider Andreas N.
Serrano Alonso
Stangl Zsofia Reka
Stangl Zsofia Réka
Street Nathaniel R.
Sundh John
Publication venue
Publication date: 01/01/2022
Field of study

Carbon storage and cycling in boreal forests—the largest terrestrial carbon store—ismoderated by complex interactions between trees and soil microorganisms. However,existing methods limit our ability to predict how changes in environmental conditionswill alter these associations and the essential ecosystem services they provide. To addressthis, we developed a metatranscriptomic approach to analyze the impact of nutrientenrichment on Norway sprucefine roots and the community structure, function, andtree–microbe coordination of over 350 root-associated fungal species. In response toaltered nutrient status, host trees redefined their relationship with the fungal commu-nity by reducing sugar efflux carriers and enhancing defense processes. This resulted ina profound restructuring of the fungal community and a collapse in functional coordi-nation between the tree and the dominant Basidiomycete species, and an increase infunctional coordination with versatile Ascomycete species. As such, there was a func-tional shift in community dominance from Basidiomycetes species, with importantroles in enzymatically cycling recalcitrant carbon, to Ascomycete species that have mela-nized cell walls that are highly resistant to degradation. These changes were accompa-nied by prominent shifts in transcriptional coordination between over 60 predictedfungal effectors, with more than 5,000 Norway spruce transcripts, providing mechanis-tic insight into the complex molecular dialogue coordinating host trees and their fungalpartners. The host–microbe dynamics captured by this study functionally inform howthese complex and sensitive biological relationships may mediate the carbon storagepotential of boreal soils under changing nutrient conditions

Epsilon Open Archive

Publikationer från Uppsala Universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Exploiting Nucleotide Composition to Engineer Promoters

Author: A Bird
A Hochheimer
A Munteanu
AB Georges
AI Su
Arkady B. Khodursky
DN Cooper
DR Liston
Evan Mauceli
F Hsu
Federica Di Palma
FM Wurm
G Badis
G Bernardi
J Butler
J Haseloff
Jens Pontiller
JM Rozenberg
Kerstin Lindblad-Toh
M Gardiner-Garden
Manfred G. Grabherr
Martina Baumann
MF Berger
MF Berger
MG Reese
Michael C. Zody
MM Babu
NG de Bruijn
P Carninci
P Hossler
P Stegmaier
Pamela Russell
RAA Veitia
Reingard M. Grabherr
Ross Swofford
S Smale
SA Benner
T Juven-Gershon
T Kim
T Omasa
Tara Biagi
W Deng
Wolfgang Ernst
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

The choice of promoter is a critical step in optimizing the efficiency and stability of recombinant protein production in mammalian cell lines. Artificial promoters that provide stable expression across cell lines and can be designed to the desired strength constitute an alternative to the use of viral promoters. Here, we show how the nucleotide characteristics of highly active human promoters can be modelled via the genome-wide frequency distribution of short motifs: by overlapping motifs that occur infrequently in the genome, we constructed contiguous sequence that is rich in GC and CpGs, both features of known promoters, but lacking homology to real promoters. We show that snippets from this sequence, at 100 base pairs or longer, drive gene expression in vitro in a number of mammalian cells, and are thus candidates for use in protein production. We further show that expression is driven by the general transcription factors TFIIB and TFIID, both being ubiquitously present across cell types, which results in less tissue- and species-specific regulation compared to the viral promoter SV40. We lastly found that the strength of a promoter can be tuned up and down by modulating the counts of GC and CpGs in localized regions. These results constitute a “proof-of-concept” for custom-designing promoters that are suitable for biotechnological and medical applications

CiteSeerX

Public Library of Science (PLOS)

Crossref

PubMed Central

An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts

Author: Abouelleil Amr
Aftuck Lynne
Alföldi Jessica
Berlin Aaron
Bessette Daniel
Brown Adam
Cook April
di Palma Federica
FitzGerald Michael G.
Gearin Gary
Grabherr Manfred G.
Greka Anna
Hoeppner Marc P.
Johnson Jeremy
Lander Eric S.
Lindblad-Toh Kerstin
Lui Annie
Lundquist Andrew
Macdonald J. Pendexter
Mauceli Evan
Meadows Jennifer R. S.
Moghadam Behrooz Torabi
Pirun Mono
Priest Margaret
Shea Terrance
Sundström Görel
Swofford Ross
Turner-Maier Jason
Zamani Neda
Zimmer Andrew
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/11/2013
Field of study

The domestic dog, Canis familiaris, is a well-established model system for mapping trait and disease loci. While the original draft sequence was of good quality, gaps were abundant particularly in promoter regions of the genome, negatively impacting the annotation and study of candidate genes. Here, we present an improved genome build, canFam3.1, which includes 85 MB of novel sequence and now covers 99.8% of the euchromatic portion of the genome. We also present multiple RNA-Sequencing data sets from 10 different canine tissues to catalog ∼175,000 expressed loci. While about 90% of the coding genes previously annotated by EnsEMBL have measurable expression in at least one sample, the number of transcript isoforms detected by our data expands the EnsEMBL annotations by a factor of four. Syntenic comparison with the human genome revealed an additional ∼3,000 loci that are characterized as protein coding in human and were also expressed in the dog, suggesting that those were previously not annotated in the EnsEMBL canine gene set. In addition to ∼20,700 high-confidence protein coding loci, we found ∼4,600 antisense transcripts overlapping exons of protein coding genes, ∼7,200 intergenic multi-exon transcripts without coding potential, likely candidates for long intergenic non-coding RNAs (lincRNAs) and ∼11,000 transcripts were reported by two different library construction methods but did not fit any of the above categories. Of the lincRNAs, about 6,000 have no annotated orthologs in human or mouse. Functional analysis of two novel transcripts with shRNA in a mouse kidney cell line altered cell morphology and motility. All in all, we provide a much-improved annotation of the canine genome and suggest regulatory functions for several of the novel non-coding transcripts

DSpace@MIT

Crossref

Harvard University - DASH

Publikationer från Uppsala Universitet

Directory of Open Access Journals

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

FigShare

Genomic Analysis of the Basal Lineage Fungus Rhizopus oryzae Reveals a Whole-Genome Duplication

Author: Abe Ayumi
Birren Bruce W.
Burger Gertraud
Butler Margi
Calvo Sarah E.
Corrochano Luis M.
Cuomo Christina A.
Elias Marek
Engels Reinhard
Fu Jianmin
Galagan James
Grabherr Manfred G.
Hansberg Wilhelm
Ibrahim Ashraf S.
Idnurm Alexander
Kim Jung-Mi
Kodira Chinnappa D.
Koehrsen Michael J.
Lang B. Franz
Liu Bo
Ma Li-Jun
Miranda-Saavedra Diego
O'Leary Sinead
Ortiz-Castellanos Lucila
Poulter Russell
Rodriguez-Romero Julio
Ruiz-Herrera José
Shen Yao-Qing
Skory Christopher
Sone Teruo
Wickes Brian L.
Zeng Qiandong
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Rhizopus oryzae is the primary cause of mucormycosis, an emerging, life-threatening infection characterized by rapid angioinvasive growth with an overall mortality rate that exceeds 50%. As a representative of the paraphyletic basal group of the fungal kingdom called “zygomycetes,” R. oryzae is also used as a model to study fungal evolution. Here we report the genome sequence of R. oryzae strain 99–880, isolated from a fatal case of mucormycosis. The highly repetitive 45.3 Mb genome assembly contains abundant transposable elements (TEs), comprising approximately 20% of the genome. We predicted 13,895 protein-coding genes not overlapping TEs, many of which are paralogous gene pairs. The order and genomic arrangement of the duplicated gene pairs and their common phylogenetic origin provide evidence for an ancestral whole-genome duplication (WGD) event. The WGD resulted in the duplication of nearly all subunits of the protein complexes associated with respiratory electron transport chains, the V-ATPase, and the ubiquitin–proteasome systems. The WGD, together with recent gene duplications, resulted in the expansion of multiple gene families related to cell growth and signal transduction, as well as secreted aspartic protease and subtilase protein families, which are known fungal virulence factors. The duplication of the ergosterol biosynthetic pathway, especially the major azole target, lanosterol 14α-demethylase (ERG11), could contribute to the variable responses of R. oryzae to different azole drugs, including voriconazole and posaconazole. Expanded families of cell-wall synthesis enzymes, essential for fungal cell integrity but absent in mammalian hosts, reveal potential targets for novel and R. oryzae-specific diagnostic and therapeutic treatments

CiteSeerX

ScholarWorks@UMass Amherst

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

idUS. Depósito de Investigación Universidad de Sevilla

microTaboo: a general and practical solution to the k-disjoint problem

Author: A Rath
AR Pavankumar
DE Knuth
Eric Sandström
FR Blattner
G Marcais
GO Sperber
I Grissa
JK Brown
M Mielczarek
Manfred Grabherr
ML Leggett
Mohammed Al-Jaff
Mouse Genome Sequencing Consortium et al
N Zamani
P Stothard
R Wu
RM Karp
RS Boyer
S Kurtz
SW Cho
T Hayashi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The genomic basis of adaptive evolution in threespine sticklebacks

Author: A Allali-Hassani
A Miyake
A Miyake
A Siepel
AA Hoffmann
AE Vinogradov
AK Knecht
Alex A. Pollen
Anne K. Knecht
AYK Albert
B Star
BE Deagle
Brian R. Summers
CB Kimmel
Chris Amemiya
CL Peichel
Craig T. Miller
CT Miller
D Hagen
David M. Kingsley
DB Lowry
DJ Kvitek
DL Stern
DL Stern
DM Kingsley
DM Kingsley
Eric S. Lander
Evan Mauceli
Ewan Birney
F Jones
Federica Di Palma
Felicity C. Jones
G Wray
GA Gutman
Haili Zhang
HE Hoekstra
J Kitano
J Yu
JA Endler
Jane Grimwood
JE Barrick
Jeremy Johnson
Jeremy Schmutz
JL Feder
JS McKinnon
Kerstin Lindblad-Toh
M Joron
M Kasahara
M Kirkpatrick
MA Bell
Manfred G. Grabherr
Mark C. Dickson
MD Shapiro
Michael C. Zody
Mono Pirun
NH Barton
O Jaillon
P Wittkopp
PA Hohenlohe
Pamela Russell
PF Colosimo
PF Colosimo
R Kawahara
RDH Barrett
Richard M. Myers
Ross Swofford
S Aparicio
SB Carroll
Shannon D. Brady
Simon White
Stephen Searle
TE Reimchen
Timothy Howes
WA Cresko
WS Marshall
YF Chan
Yingguang Frank Chan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2011
Field of study

Marine stickleback fish have colonized and adapted to thousands of streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high-quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of twenty additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine–freshwater divergence. Our results indicate that reuse of globally shared standing genetic variation, including chromosomal inversions, has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine–freshwater evolution, but regulatory changes appear to predominate in this well known example of repeated adaptive evolution in nature.National Human Genome Research Institute (U.S.)National Human Genome Research Institute (U.S.) (NHGRI CEGS Grant P50-HG002568

DSpace@MIT

Crossref

PubMed Central

eScholarship - University of California

MPG.PuRe