Search CORE

123 research outputs found

Finite covers of random 3-manifolds

Author: A. Lubotzky
A. Żuk
A.L. Edmonds
A.Y. Ol’shanskiĭ
B. Everitt
B. Weisfeiler
B. Zimmermann
C. Itzykson
C. Livingston
D. Jungreis
D. Zagier
D.M. Jackson
F. Waldhausen
H.M. Hilden
J. Harer
J. Hempel
J. Hempel
J. MacWilliams
L. Carlitz
M. Belolipetsky
M. Gromov
M.E. White
M.J. Evans
M.W. Liebeck
M.W. Liebeck
N.M. Dunfield
Nathan M. Dunfield
P. Diaconis
P. Hall
R. Gilman
R.C. Penner
R.P. Stanley
W. Ledermann
William P. Thurston
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/11/2007
Field of study

A 3-manifold is Haken if it contains a topologically essential surface. The Virtual Haken Conjecture posits that every irreducible 3-manifold with infinite fundamental group has a finite cover which is Haken. In this paper, we study random 3-manifolds and their finite covers in an attempt to shed light on this difficult question. In particular, we consider random Heegaard splittings by gluing two handlebodies by the result of a random walk in the mapping class group of a surface. For this model of random 3-manifold, we are able to compute the probabilities that the resulting manifolds have finite covers of particular kinds. Our results contrast with the analogous probabilities for groups coming from random balanced presentations, giving quantitative theorems to the effect that 3-manifold groups have many more finite quotients than random groups. The next natural question is whether these covers have positive betti number. For abelian covers of a fixed type over 3-manifolds of Heegaard genus 2, we show that the probability of positive betti number is 0. In fact, many of these questions boil down to questions about the mapping class group. We are lead to consider the action of mapping class group of a surface S on the set of quotients pi_1(S) -> Q. If Q is a simple group, we show that if the genus of S is large, then this action is very mixing. In particular, the action factors through the alternating group of each orbit. This is analogous to Goldman's theorem that the action of the mapping class group on the SU(2) character variety is ergodic.Comment: 60 pages; v2: minor changes. v3: minor changes; final versio

arXiv.org e-Print Archive

Crossref

Evidence for a novel overlapping coding sequence in POLG initiated at a CUG start codon

Author: Choudhary Jyoti S.
Firth Andrew E.
Jungreis Irwin
Kellis Manolis
Khan Yousuf A.
Mudge Jonathan M.
Wright James C.
Publication venue: BMC Genetics
Publication date: 06/03/2020
Field of study

Abstract: Background: POLG, located on nuclear chromosome 15, encodes the DNA polymerase γ(Pol γ). Pol γ is responsible for the replication and repair of mitochondrial DNA (mtDNA). Pol γ is the only DNA polymerase found in mitochondria for most animal cells. Mutations in POLG are the most common single-gene cause of diseases of mitochondria and have been mapped over the coding region of the POLG ORF. Results: Using PhyloCSF to survey alternative reading frames, we found a conserved coding signature in an alternative frame in exons 2 and 3 of POLG, herein referred to as ORF-Y that arose de novo in placental mammals. Using the synplot2 program, synonymous site conservation was found among mammals in the region of the POLG ORF that is overlapped by ORF-Y. Ribosome profiling data revealed that ORF-Y is translated and that initiation likely occurs at a CUG codon. Inspection of an alignment of mammalian sequences containing ORF-Y revealed that the CUG codon has a strong initiation context and that a well-conserved predicted RNA stem-loop begins 14 nucleotides downstream. Such features are associated with enhanced initiation at near-cognate non-AUG codons. Reanalysis of the Kim et al. (2014) draft human proteome dataset yielded two unique peptides that map unambiguously to ORF-Y. An additional conserved uORF, herein referred to as ORF-Z, was also found in exon 2 of POLG. Lastly, we surveyed Clinvar variants that are synonymous with respect to the POLG ORF and found that most of these variants cause amino acid changes in ORF-Y or ORF-Z. Conclusions: We provide evidence for a novel coding sequence, ORF-Y, that overlaps the POLG ORF. Ribosome profiling and mass spectrometry data show that ORF-Y is expressed. PhyloCSF and synplot2 analysis show that ORF-Y is subject to strong purifying selection. An abundance of disease-correlated mutations that map to exons 2 and 3 of POLG but also affect ORF-Y provides potential clinical significance to this finding

DSpace@MIT

Apollo (Cambridge)

Institute of Cancer Research Repository

Discovery of Human sORF-Encoded Polypeptides (SEPs) in Cell Lines and Tissue

Author: Budnik Bogdan A.
Jungreis Irwin
Kellis Manolis
Ma Jiao
Neveu John
Saghatelian Alan
Schwaid Adam G.
Slavoff Sarah A.
Ward Carl C.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 14/02/2014
Field of study

The existence of nonannotated protein-coding human short open reading frames (sORFs) has been revealed through the direct detection of their sORF-encoded polypeptide (SEP) products. The discovery of novel SEPs increases the size of the genome and the proteome and provides insights into the molecular biology of mammalian cells, such as the prevalent usage of non-AUG start codons. Through modifications of the existing SEP-discovery workflow, we discover an additional 195 SEPs in K562 cells and extend this methodology to identify novel human SEPs in additional cell lines and human tissue for a final tally of 237 new SEPs. These results continue to expand the human genome and proteome and demonstrate that SEPs are a ubiquitous class of nonannotated polypeptides that require further investigation

DSpace@MIT

Crossref

Harvard University - DASH

PubMed Central

FigShare

Recommended from our members

A high-resolution map of human evolutionary constraint using 29 mammals.

Author: Alföldi Jessica
Baldwin Jen
Baylor College of Medicine Human Genome Sequencing Center Sequencing Team
Beal Kathryn
Birney Ewan
Bloom Toby
Broad Institute Sequencing Platform and Whole Genome Assembly Team
Chang Jean
Chin Chee Whye
Clamp Michele
Clawson Hiram
Cree Andrew
Cuff James
Delehaunty Kim
Di Palma Federica
Dihn Huyen H
Dooling David
Ernst Jason
Fitzgerald Stephen
Flicek Paul
Fowler Gerald
Fronik Catrina
Fulton Bob
Fulton Lucinda
Garber Manuel
Genome Institute at Washington University
Gibbs Richard A
Gnerre Sante
Goldman Nick
Graves Tina
Green Eric D
Guttman Mitchell
Haussler David
Heiman Dave
Herrero Javier
Holloway Alisha K
Hubisz Melissa J
Jaffe David B
Jhangiani Shalili
Jordan Gregory
Joshi Vandita
Jungreis Irwin
Kellis Manolis
Kent W James
Kheradpour Pouya
Kostka Dennis
Kovar Christie L
Lander Eric S
Lara Marcia
Lee Sandra
Lewis Lora R
Lin Michael F
Lindblad-Toh Kerstin
Lowe Craig B
Mardis Elaine R
Margulies Elliott H
Martins Andre L
Massingham Tim
Mauceli Evan
Minx Patrick
Moltke Ida
Muzny Donna M
Nazareth Lynne V
Nicol Robert
Nusbaum Chad
Okwuonu Geoffrey
Parker Brian J
Pedersen Jakob S
Pollard Katherine S
Raney Brian J
Rasmussen Matthew D
Robinson Jim
Santibanez Jireh
Siepel Adam
Sodergren Erica
Stark Alexander
Vilella Albert J
Ward Lucas D
Warren Wesley C
Washietl Stefan
Weinstock George M
Wen Jiayu
Wilkinson Jane
Wilson Richard K
Worley Kim C
Xie Xiaohui
Young Sarah
Zody Michael C
Zuk Or
Publication venue: eScholarship, University of California
Publication date: 01/10/2011
Field of study

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease

eScholarship - University of California

Discovery of high-confidence human protein-coding genes and exons by whole-genome PhyloCSF helps elucidate 118 GWAS loci.

Author: Bruford E.
Choudhary J.S.
Davidson C.
Fitzgerald S.
Frankish A.
Gonzalez J.M.
He L.
Hunt T.
Jungreis I.
Kay M.
Kellis M.
Li Y.
Mudge J.M.
Seal R.
Tweedie S.
Waterhouse R.M.
Wright J.C.
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/12/2019
Field of study

The most widely appreciated role of DNA is to encode protein, yet the exact portion of the human genome that is translated remains to be ascertained. We previously developed PhyloCSF, a widely used tool to identify evolutionary signatures of protein-coding regions using multispecies genome alignments. Here, we present the first whole-genome PhyloCSF prediction tracks for human, mouse, chicken, fly, worm, and mosquito. We develop a workflow that uses machine learning to predict novel conserved protein-coding regions and efficiently guide their manual curation. We analyze more than 1000 high-scoring human PhyloCSF regions and confidently add 144 conserved protein-coding genes to the GENCODE gene set, as well as additional coding regions within 236 previously annotated protein-coding genes, and 169 pseudogenes, most of them disabled after primates diverged. The majority of these represent new discoveries, including 70 previously undetected protein-coding genes. The novel coding genes are additionally supported by single-nucleotide variant evidence indicative of continued purifying selection in the human lineage, coding-exon splicing evidence from new GENCODE transcripts using next-generation transcriptomic data sets, and mass spectrometry evidence of translation for several new genes. Our discoveries required simultaneous comparative annotation of other vertebrate genomes, which we show is essential to remove spurious ORFs and to distinguish coding from pseudogene regions. Our new coding regions help elucidate disease-associated regions by revealing that 118 GWAS variants previously thought to be noncoding are in fact protein altering. Altogether, our PhyloCSF data sets and algorithms will help researchers seeking to interpret these genomes, while our new annotations present exciting loci for further experimental characterization

Serveur académique lausannois

Institute of Cancer Research Repository

Genomic RNA Elements Drive Phase Separation of the SARS-CoV-2 Nucleocapsid

Author: Baric R.S.
Boerneke M.A.
Ekena J.
Fritch E.J.
Gladfelter A.S.
Hou Y.J.
Iserman C.
Jungreis I.
Kellis M.
McLaughlin G.A.
Roden C.A.
Sealfon R.S.G.
Sheahan T.P.
Theesfeld C.L.
Troyanskaya O.G.
Weeks K.M.
Weidmann C.A.
Publication venue: Cell Press
Publication date: 01/01/2020
Field of study

We report that the SARS-CoV-2 nucleocapsid protein (N-protein) undergoes liquid-liquid phase separation (LLPS) with viral RNA. N-protein condenses with specific RNA genomic elements under physiological buffer conditions and condensation is enhanced at human body temperatures (33°C and 37°C) and reduced at room temperature (22°C). RNA sequence and structure in specific genomic regions regulate N-protein condensation while other genomic regions promote condensate dissolution, potentially preventing aggregation of the large genome. At low concentrations, N-protein preferentially crosslinks to specific regions characterized by single-stranded RNA flanked by structured elements and these features specify the location, number, and strength of N-protein binding sites (valency). Liquid-like N-protein condensates form in mammalian cells in a concentration-dependent manner and can be altered by small molecules. Condensation of N-protein is RNA sequence and structure specific, sensitive to human body temperature, and manipulatable with small molecules, and therefore presents a screenable process for identifying antiviral compounds effective against SARS-CoV-2

Carolina Digital Repository

Evolution of enhanced innate immune evasion by SARS-CoV-2

Author: Batra J
Beltrao P
Bischof ML
Bonfanti P
Bouhaddou M
Braberg H
Chen KH
Fabius JM
Fossati A
García-Sastre A
Goodfellow IG
Harjai B
Hiatt J
Hosmillo M
Jahun A
Jolly C
Jungreis I
Jura N
Kellis M
Krogan NJ
McGovern BL
Memon D
Noursadeghi M
Obernier K
Pelin A
Polacco B
Ragazzini R
Reuschl AK
Richards A
Rojc A
Rosales R
Shokat K
Soucheray M
Swaney DL
Takeuchi Y
Thorne LG
Towers GJ
Turner J
Ummadi M
Verba K
Whelan MVX
White K
Zuliani-Alvarez L
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/02/2022
Field of study

Emergence of SARS-CoV-2 variants of concern (VOCs) suggests viral adaptation to enhance human-to-human transmission1,2. Although much effort has focused on characterisation of spike changes in VOCs, mutations outside spike likely contribute to adaptation. Here we used unbiased abundance proteomics, phosphoproteomics, RNAseq and viral replication assays to show that isolates of the Alpha (B.1.1.7) variant3 more effectively suppress innate immune responses in airway epithelial cells, compared to first wave isolates. We found that Alpha has dramatically increased subgenomic RNA and protein levels of N, Orf9b and Orf6, all known innate immune antagonists. Expression of Orf9b alone suppressed the innate immune response through interaction with TOM70, a mitochondrial protein required for RNA sensing adaptor MAVS activation. Moreover, the activity of Orf9b and its association with TOM70 was regulated by phosphorylation. We propose that more effective innate immune suppression, through enhanced expression of specific viral antagonist proteins, increases the likelihood of successful Alpha transmission, and may increase in vivo replication and duration of infection4. The importance of mutations outside Spike in adaptation of SARS-CoV-2 to humans is underscored by the observation that similar mutations exist in the Delta and Omicron N/Orf9b regulatory regions

UCL Discovery

GENCODE: reference annotation for the human and mouse genomes in 2023.

Author: Arnan Carme
Banerjee Abhimanyu
Barnes If
Bennett Ruth
Berry Andrew
Bignell Alexandra
Boix Carles
Calvet Ferriol
Carbonell-Sala Sílvia
Cerdán-Vélez Daniel
Choudhary Jyoti S
Cunningham Fiona
Davidson Claire
Diekhans Mark
Donaldson Sarah
Dursun Cagatay
Fatima Reham
Flicek Paul
Frankish Adam
Gerstein Mark
Giorgetti Stefano
Giron Carlos Garcıa
Gonzalez Jose Manuel
Guigo Roderic
Gómez Laura Martínez
Hardy Matthew
Harrison Peter W
Hollis Zoe
Hourlier Thibaut
Hubbard Tim J P
Hunt Toby
James Benjamin
Jiang Yunzhe
Johnson Rory
Jungreis Irwin
Kay Mike
Kellis Manolis
Kundaje Anshul
Lagarde Julien
Loveland Jane E
Martin Fergal J
Mudge Jonathan M
Nair Surag
Ni Pengyu
Paten Benedict
Pozo Fernando
Ramalingam Vivek
Ruffier Magali
Schmitt Bianca M
Schreiber Jacob M
Sisu Cristina
Steed Emily
Sumathipala Dulika
Suner Marie-Marthe
Sycheva Irina
Tress Michael L
Uszczynska-Ratajczak Barbara
Wass Elizabeth
Wright James C
Yang Yucheng T
Yates Andrew
Zafrulla Zahoor
Publication venue: 'Oxford University Press (OUP)'
Publication date: 24/11/2022
Field of study

GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Here, we present an update on the annotation of human and mouse genes, including developments in the tools, data, analyses and major collaborations which underpin this progress. For example, we report the creation of a set of non-canonical ORFs identified in GENCODE transcripts, the LRGASP collaboration to assess the use of long transcriptomic data to build transcript models, the progress in collaborations with RefSeq and UniProt to increase convergence in the annotation of human and mouse protein-coding genes, the propagation of GENCODE across the human pan-genome and the development of new tools to support annotation of regulatory features by GENCODE. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org

Bern Open Repository and Information System (BORIS)

A High-Resolution Map of Human Evolutionary Constraint Using 29 Mammals

Author: A Keinan
A Siepel
A Siepel
A Stark
Adam Siepel
Albert J. Vilella
Alexander Stark
Alisha K. Holloway
Andre L. Martins
Brian J. Parker
Brian J. Raney
CB Lowe
Christie L. Kovar
Craig B. Lowe
D Altshuler
D Baek
D Pillas
D Schmidt
David B. Jaffe
David Haussler
Dennis Kostka
Donna M. Muzny
Elaine R. Mardis
Elliott H. Margulies
Eric D. Green
Eric S. Lander
ES Lander
ET Wang
EV Davydov
Evan Mauceli
Ewan Birney
F Chiaromonte
Federica Di Palma
G Bejerano
Genome 10K Community Of Scientists
George M. Weinstock
GM Cooper
Gregory Jordan
Hiram Clawson
Ida Moltke
Irwin Jungreis
J Ernst
J Ernst
J Harrow
JA Drake
Jakob S. Pedersen
James Cuff
Jason Ernst
Javier Herrero
Jean Chang
Jessica Alföldi
Jiayu Wen
Jim Robinson
JS Pedersen
JT Lee
JW Thomas
K Lindblad-Toh
Katherine S. Pollard
Kathryn Beal
KD Pruitt
Kerstin Lindblad-Toh
Kim C. Worley
KS Pollard
Lucas D. Ward
M Clamp
M Garber
M Guttman
M Kellis
Manolis Kellis
Manuel Garber
Marcia Lara
Maria L. Martínez-Chantar
Matthew D. Rasmussen
Melissa J. Hubisz
MF Lin
MF Lin
Michael C. Zody
Michael F. Lin
Michele Clamp
Mitchell Guttman
MJ Hubisz
Nick Goldman
Or Zuk
P Kheradpour
Paul Flicek
Pouya Kheradpour
RA Gibbs
RH Waterston
Richard A. Gibbs
Richard K. Wilson
S Gnerre
S Maenner
S Meader
S Prabhakar
S Tumpel
S Washietl
Sante Gnerre
Stefan Washietl
Stephen Fitzgerald
Tim Massingham
TS Mikkelsen
W. James Kent
Wesley C. Warren
X Lampe
X Xie
Xiaohui Xie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ~4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ~60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease.National Human Genome Research Institute (U.S.)National Institute of General Medical Sciences (U.S.) (Grant number GM82901)National Science Foundation (U.S.). Postdoctural Fellowship (Award 0905968)National Science Foundation (U.S.). Career (0644282)National Institutes of Health (U.S.) (R01-HG004037)Alfred P. Sloan Foundation.Austrian Science Fund. Erwin Schrodinger Fellowshi

DSpace@MIT

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Copenhagen University Research Information System

PubMed Central

eScholarship - University of California

Ebola virus epidemiology, transmission, and evolution during seven months in Sierra Leone

Author: Amman Brian
Andersen Kristian G.
Basile Jane
Bearden Scott
Bedford Trevor
Belser Jessica
Bergeron Eric
Bird Brian
Birren Bruce W.
Blau Dianna
Bochicchio James
Brault Aaron
Campbell Shelley
Chakrabarti Ayan
Chapman Sinead
Dodd Kimberly
Dudas Gytis
Erickson Bobbie R.
Flint Mike
Foday Momoh
Forget Marc
Garry Robert F.
Gbakie Michael
Gevao Sahr M.
Gibbons Aridth
Gire Stephen K.
Gladden-Young Adrianne
Gnirke Andreas
Goba Augustine
Goodman Christin
Happi Christian
Hensley Lisa E.
Holmes Edward C.
Jalloh Abdul A.
Jalloh Simbirie
Jiang Pan-Pan
Jungreis Irwin
Kamara Fatima K.
Kanneh Lansana
Kargbo Brima
Kargbo David
Kislyuk Andrey
Klena John
Konuwa Edwin
Kugelman Jeffrey R.
Kuhn Jens H.
Ladner Jason T.
Lin Aaron E.
Lin Michael F.
MacInnis Bronwyn
Mamoh Mambu
Massally James L.B.
Matranga Christian B.
Matthews Ashley
McMullan Laura
Morgan Laura
Moses Lina M.
Mustapha Ibrahim
Ndiaye Daouda
Nichol Stuart T.
Nosamiefan Dolo
Nusbaum Chad
Paddock Christopher
Palacios Gustavo F.
Park Daniel J.
Qu James
Rambaut Andrew
Russell Brandy
Sabeti Pardis C.
Salzer Johanna
Sanchez Angela
Schaffner Stephen F.
Schieffelin John S.
Sealfon Rachel S.
Sealy Tara
Sellu Josephine
Ströher Ute
Tomkins-Tinch Christopher
Towner Jonathan
Vandi Mohamed A.
Wang David
Whitmer Shannon L.M.
Winnicki Sarah M.
Wohl Shirlee
Yillah Mohamed
Yozwiak Nathan L.
Publication venue: 'Elsevier BV'
Publication date: 18/06/2015
Field of study

The 2013-2015 Ebola virus disease (EVD) epidemic is caused by the Makona variant of Ebola virus (EBOV). Early in the epidemic, genome sequencing provided insights into virus evolution and transmission and offered important information for outbreak response. Here, we analyze sequences from 232 patients sampled over 7 months in Sierra Leone, along with 86 previously released genomes from earlier in the epidemic. We confirm sustained human-to-human transmission within Sierra Leone and find no evidence for import or export of EBOV across national borders after its initial introduction. Using high-depth replicate sequencing, we observe both host-to-host transmission and recurrent emergence of intrahost genetic variants. We trace the increasing impact of purifying selection in suppressing the accumulation of nonsynonymous mutations over time. Finally, we note changes in the mucin-like domain of EBOV glycoprotein that merit further investigation. These findings clarify the movement of EBOV within the region and describe viral evolution during prolonged human-to-human transmission

Elsevier - Publisher Connector

PubMed Central

Edinburgh Research Explorer

MSF Field Research