Search CORE

41 research outputs found

Management, Analyses, and Distribution of the MaizeCODE Data on the Cloud

Author: Birnbaum Kenneth
delaBastide Melissa
Dobin Alexander
Drenkow Jorg
Fernandez-Marco Cristina
Ghiban Cornel
Gingeras Thomas
Goodwin Sara
Jackson David
Lu Zhenyuan
Martienssen Robert
McCombie William
Micklos David
Ortiz-Ramirez Carlos
Regulski Michael
Schatz Michael
Van Buren Peter
Wang Liya
Wang Xiaofei
Ware Doreen
Xu Xiaosa
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 23/11/2019
Field of study

Cold Spring Harbor Laboratory Institutional Repository

Enhanced Transcriptome Maps from Multiple Mouse Tissues Reveal Evolutionary Constraint in Gene Expression for Thousands of Genes

Author: Balasubramanian Suganthi
Beer Michael
Breschi Alessandra
Bussotti Giovanni
Davis Carrie
Djebali Sarah
Dobin Alex
Drenkow Jorg
Fastuca Meagan
Gerstein Mark
Gingeras Thomas
Guigo Roderic
Harmanci Arif
Lagarde Julien
Monlong Jean
Notredame Cedric
Pei Baikang
Pervouchine Dmitri
Prieto Barja Pablo
See Lei-Hoon
Tanzer Andrea
Wang Huaien
Zaleski Chris
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 30/10/2014
Field of study

We characterized by RNA-seq the transcriptional profiles of a large and heterogeneous collection of mouse tissues, augmenting the mouse transcriptome with thousands of novel transcript candidates. Comparison with transcriptome profiles obtained in human cell lines reveals substantial conservation of transcriptional programs, and uncovers a distinct class of genes with levels of expression across cell types and species, that have been constrained early in vertebrate evolution. This core set of genes capture a substantial and constant fraction of the transcriptional output of mammalian cells, and participates in basic functional and structural housekeeping processes common to all cell types. Perturbation of these constrained genes is associated with significant phenotypes including embryonic lethality and cancer. Evolutionary constraint in gene expression levels is not reflected in the conservation of the genomic sequences, but is associated with strong and conserved epigenetic marking, as well as to a characteristic post-transcriptional regulatory program in which sub-cellular localization and alternative splicing play comparatively large roles

Cold Spring Harbor Laboratory Institutional Repository

Landscape of transcription in human cells

Eukaryotic cells make many types of primary and processed RNAs that are found either in specific sub-cellular compartments or throughout the cells. A complete catalogue of these RNAs is not yet available and their characteristic sub-cellular localizations are also poorly understood. Since RNA represents the direct output of the genetic information encoded by genomes and a significant proportion of a cell’s regulatory capabilities are focused on its synthesis, processing, transport, modifications and translation, the generation of such a catalogue is crucial for understanding genome function. Here we report evidence that three quarters of the human genome is capable of being transcribed, as well as observations about the range and levels of expression, localization, processing fates, regulatory regions and modifications of almost all currently annotated and thousands of previously unannotated RNAs. These observations taken together prompt to a redefinition of the concept of a gene

Carolina Digital Repository

Evidence for Transcript Networks Composed of Chimeric RNAs in Human Cells

Author: A Dobin
A Pombo
Adam Frankish
AJ Walhout
Alex Dobin
Alexandre Reymond
Alfonso Valencia
Bryan R. Lajoie
CA Maher
Catherine Ucla
Chenwei Lin
Christelle Borel
CJ McManus
Cédric Howald
D Gordon
DA Jackson
David Martin
E Birney
E Gilboa
EL Sonnhammer
Erica Dumais
F Denoeud
F Ozsolak
G Parra
H Kaessmann
H Li
HM Temin
Ian Bell
J Cocquet
J Dostie
J Harrow
J Houseley
Jacqueline Chrast
JE Collins
Jennifer Harrow
JL Thorvaldsen
Job Dekker
John Stamatoyannopoulos
Jonathan M. Mudge
Jorg Drenkow
Josep Lluís Gelpí
Julien Lagarde
K Kannan
K Salehi-Ashtiani
Kourosh Salehi-Ashtiani
LG Wilming
Lila Ghamsari
M Krzywinski
MA Quail
Marc Vidal
MI Krzywinski
Michael L. Tress
MJ Fullwood
Modesto Orozco
Nynke L. van Berkum
P Akiva
P Kapranov
P Unneberg
Paolo Ribeca
Philipp Kapranov
Philippe Batut
R Durbin
R Khanin
Roderic Guigó
RR Bowman
Ryan R. Murray
S Djebali
S Rozen
Sarah Djebali
SF Altschul
SM Searle
Stylianos E. Antonarakis
SW Roy
Sylvain Foissac
Thomas Preiss
Thomas R. Gingeras
Tim Hubbard
TR Gingeras
Vincent Lacroix
WJ Kent
X Li
X Wu
Xinping Yang
Y Qu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The classic organization of a gene structure has followed the Jacob and Monod bacterial gene model proposed more than 50 years ago. Since then, empirical determinations of the complexity of the transcriptomes found in yeast to human has blurred the definition and physical boundaries of genes. Using multiple analysis approaches we have characterized individual gene boundaries mapping on human chromosomes 21 and 22. Analyses of the locations of the 5′ and 3′ transcriptional termini of 492 protein coding genes revealed that for 85% of these genes the boundaries extend beyond the current annotated termini, most often connecting with exons of transcripts from other well annotated genes. The biological and evolutionary importance of these chimeric transcripts is underscored by (1) the non-random interconnections of genes involved, (2) the greater phylogenetic depth of the genes involved in many chimeric interactions, (3) the coordination of the expression of connected genes and (4) the close in vivo and three dimensional proximity of the genomic regions being transcribed and contributing to parts of the chimeric RNAs. The non-random nature of the connection of the genes involved suggest that chimeric transcripts should not be studied in isolation, but together, as an RNA network

Cold Spring Harbor Laboratory Institutional Repository

Directory of Open Access Journals

Serveur académique lausannois

HAL Descartes

eScholarship@UMMS

UPF Digital Repository

ProdInra

Hal-Diderot

FigShare

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

INRIA a CCSD electronic archive server

PubMed Central

King's Research Portal

Diposit Digital de la Universitat de Barcelona

HAL-Rennes 1

Multi-tissue integrative analysis of personal epigenomes

Author: Adrian Jessika
Aganezov Sergey
Balderrama-Gutierrez Gabriela
Banskota Samridhi
Bernstein Bradley
Berthel Ana
Borsari Beatrice
Cameron Christopher
Chang Justin
Chee Sora
Chen Zhanlin
Cherry Michael
Chhetri Surya
Choudhary Jyoti
Corona Guillermo
Danyko Cassidy
Davis Carrie
Dobin Alexander
Drenkow Jorg
Epstein Charles
Farid Daniel
Farrell Nina
Gabdank Idan
Galeev Timur
Gao Jiahao
Gaskell Elizabeth
Gerstein Mark
Gillis Jesse
Gingeras Thomas
Gofin Yoel
Gorkin David
Gu Mengting
Guigo Roderic
Gursoy Gamze
Hecht Vivian
Hitz Benjamin
Issner Robbyn
Kirsche Melanie
Kong Xiangmeng
Lam Bonita
Levine Morgan
Li Bian
Li Shantao
Li Tianxiao
Li Xiqi
Lin Khine
Liu Jason
Luo Ruibang
Mackiewicz Mark
Martins Gabriel
Mendenhall Eric
Milosavljevic Aleksandar
Moore Jill
Mortazavi Ali
Mudge Jonathan
Myers Richard
Navarro Fabio
Nelson Nicholas
Noble William
Nusbaum Chad
Popov Ioann
Pratt Henry
Qiu Yunjiang
Ramakrishnan Srividya
Raymond Joe
Ren Bing
Rozowsky Joel
Salichos Leonidas
Scavelli Alexandra
Schatz Michael
Schreiber Jacob
Sedlazeck Fritz
See Lei
Sherman Rachel
Shi Minyi
Shi Xu
Shoresh Noam
Sloan Cricket
Snyder Michael
Strattan Seth
Sun Maxwell
Tan Zhen
Tanaka Forrest
Vlasova Anna
Wang Jun
Weng Zhiping
Werner Jonathan
Williams Brian
Wold Barbara
Wright James
Xiong Kun
Xu Jinrui
Xu Min
Yan Chengfei
Yang Yucheng
Yu Keyang
Yu Lu
Zaleski Christopher
Zhang Jing
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 26/04/2021
Field of study

Evaluating the impact of genetic variants on transcriptional regulation is a central goal in biological science that has been constrained by reliance on a single reference genome. To address this, we constructed phased, diploid genomes for four cadaveric donors (using long-read sequencing) and systematically charted noncoding regulatory elements and transcriptional activity across more than 25 tissues from these donors. Integrative analysis revealed over a million variants with allele-specific activity, coordinated, locus-scale allelic imbalances, and structural variants impacting proximal chromatin structure. We relate the personal genome analysis to the ENCODE encyclopedia, annotating allele- and tissue-specific elements that are strongly enriched for variants impacting expression and disease phenotypes. These experimental and statistical approaches, and the corresponding EN-TEx resource, provide a framework for personalized functional genomics

Cold Spring Harbor Laboratory Institutional Repository

Caltech Authors

Comparative analysis of the transcriptome across distant species

Author: Adam Frankish
Alex Dobin
Alexandre Reymond
Ali Mortazavi
Anastasia Samsonova
Andrea Tanzer
Ann Hammonds
Anurag Sethi
Arif O. Harmanci
AT Kalinka
Baikang Pei
Benjamin W. Booth
BR Graveley
Brent Ewing
Brenton R. Graveley
Brian Oliver
Burak H. Alver
Carrie A. Davis
Chao Cheng
Chao Di
Chau Huynh
Chenghai Xue
Chris Zaleski
Cristina Sisu
Cédric Howald
D Brawand
Daifeng Wang
David M. Miller
DF Simola
Dionna Kasper
Dmitri Pervouchine
Elise A. Feingold
Eric Lai
Erik Ladewig
Felix Schlesinger
Frank J. Slack
Gang Fang
Garrett Robinson
Gary I. Saunders
Gemma May
Gennifer Merrihew
Guanjun Gao
Guilin Wang
Haiyan Huang
Henry Zheng
Huaien Wang
J Merkin
J Reichardt
James B. Brown
Jen Harrow
Jiayu Wen
Jing Leng
Jingyi Jessica Li
JJ Li
JM Stuart
Joel Rozowsky
Jorg Drenkow
Julien Lagarde
Kathie L. Watkins
Kejia Wen
Kenneth H. Wan
Kevin Yip
Kimberly Bell
KK Yan
Koon-Kiu Yan
LaDeana Hillier
Li Yang
Long Hu
Lucy Cherbas
M Levin
M Talerico
Marcus H. Stoiber
Mark B. Gerstein
Masaomi Kato
Max E. Boeck
MB Gerstein
Megan Fastuca
Michael J. Pazin
Michael MacCoss
Michael O. Duff
modENCODE Consortium
Nathan P. Boley
NL Barbosa-Morais
Norbert Perrimon
Owen A. Thompson
Peter Cherbas
Peter J. Bickel
Peter J. Good
Peter J. Park
Pnina Strasbourger
R Karlić
Rabi Murad
Raymond Auerbach
Rebecca McWhirter
Robert R. Kitchen
Robert Waterston
Roderic Guigó
Roger A. Hoskins
Roger P. Alexander
S Djebali
S Kirkpatrick
Sara Olson
Sarah Djebali
Sonali Jha
Steven E. Brenner
Susan E. Celniker
T Domazet-Lošo
Thomas C. Kaufman
Thomas R. Gingeras
Tim J. P. Hubbard
Valerie Reinke
William C. Spencer
Yan Zhang
Zhi Lu
ZJ Lu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters

Crossref

Cold Spring Harbor Laboratory Institutional Repository

University of Birmingham Research Portal

Harvard University - DASH

Serveur académique lausannois

PubMed Central

eScholarship - University of California

UPF Digital Repository

King's Research Portal

Brunel University Research Archive

Detection of Deleted Genomic DNA Using a Semiautomated Computational Analysis of GeneChip Data

Author: Drenkow Jorg
Gingeras Thomas R.
Kato-Maeda Midori
Salamon Hugh
Small Peter M.
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/01/2000
Field of study

Genomic diversity within and between populations is caused by single nucleotide mutations, changes in repetitive DNA systems, recombination mechanisms, and insertion and deletion events. The contribution of these sources to diversity, whether purely genetic or of phenotypic consequence, can only be investigated if we have the means to quantitate and characterize diversity in many samples. With the advent of complete sequence characterization of representative genomes of different species, the possibility of developing protocols to screen for genetic polymorphism across entire genomes is actively being pursued. The large numbers of measurements such approaches yield demand that we pay careful attention to the numerical analysis of data. In this paper we present a novel application of an Affymetrix GeneChip to perform genome-wide screens for deletion polymorphism. A high-density oligonucleotide array formatted for mRNA expression and targeted at a fully sequenced 4.4-million–base pair Mycobacterium tuberculosis standard strain genome was adapted to compare genomic DNA. Hybridization intensities to 111,000 probe pairs (perfect complement and mismatch complement) were measured for genomic DNA from a clinical strain and from a vaccine organism. Because individual probe-pair hybridization intensities exhibit limited sensitivity/specificity characteristics to detect deletions, data-analytical methodology to exploit measurements from multiple probes in tandem locations across the genome was developed. The TSTEP (Tandem Set Terminal Extreme Probability) algorithm designed specifically to analyze the tandem hybridization measurements data was applied and shown to discover genomic deletions with high sensitivity. The TSTEP algorithm provides a foundation for similar efforts to characterize deletions in many hybridization measures in similar-sized and larger genomes. Issues relating to the design of genome content screening experiments and the implications of these methods for studying population genomics and the evolution of genomes are discussed

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

Examples of the complex architecture of the human transcriptome revealed by RACE and high-density tiling arrays

Author: Cheng Jill
Dike Sujit
Drenkow Jorg
Gingeras Thomas R.
Helt Gregg
Kapranov Philipp
Long Jeffrey
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/01/2005
Field of study

Recently, we mapped the sites of transcription across ∼30% of the human genome and elucidated the structures of several hundred novel transcripts. In this report, we describe a novel combination of techniques including the rapid amplification of cDNA ends (RACE) and tiling array technologies that was used to further characterize transcripts in the human transcriptome. This technical approach allows for several important pieces of information to be gathered about each array-detected transcribed region, including strand of origin, start and termination positions, and the exonic structures of spliced and unspliced coding and noncoding RNAs. In this report, the structures of transcripts from 14 transcribed loci, representing both known genes and unannotated transcripts taken from the several hundred randomly selected unannotated transcripts described in our previous work are represented as examples of the complex organization of the human transcriptome. As a consequence of this complexity, it is not unusual that a single base pair can be part of an intricate network of multiple isoforms of overlapping sense and antisense transcripts, the majority of which are unannotated. Some of these transcripts follow the canonical splicing rules, whereas others combine the exons of different genes or represent other types of noncanonical transcripts. These results have important implications concerning the correlation of genotypes to phenotypes, the regulation of complex interlaced transcriptional patterns, and the definition of a gene

Crossref

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

Comparing Genomes within the Species Mycobacterium tuberculosis

Author: Drenkow Jorg
Gingeras Thomas R.
Kato-Maeda Midori
Rhee Jeanne T.
Salamon Hugh
Small Peter M.
Smittipat Nat
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/01/2001
Field of study

The study of genetic variability within natural populations of pathogens may provide insight into their evolution and pathogenesis. We used a Mycobacterium tuberculosis high-density oligonucleotide microarray to detect small-scale genomic deletions among 19 clinically and epidemiologically well-characterized isolates of M. tuberculosis. The pattern of deletions detected was identical within mycobacterial clones but differed between different clones, suggesting that this is a suitable genotyping system for epidemiologic studies. An analysis of genomic deletions among an extant population of pathogenic bacteria provided a novel perspective on genomic organization and evolution. Deletions are likely to contain ancestral genes whose functions are no longer essential for the organism's survival, whereas genes that are never deleted constitute the minimal mycobacterial genome. As the amount of genomic deletion increased, the likelihood that the bacteria will cause pulmonary cavitation decreased, suggesting that the accumulation of mutations tends to diminish their pathogenicity. Array-based comparative genomics is a promising approach to exploring molecular epidemiology, microbial evolution, and pathogenesis

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central