Search CORE

90 research outputs found

motifDiverge: a model for assessing the statistical significance of gene regulatory motif divergence between two DNA sequences

Author: Friedrich Tara
Holloway Alisha K.
Kostka Dennis
Pollard Katherine S.
Publication venue
Publication date: 31/01/2014
Field of study

Next-generation sequencing technology enables the identification of thousands of gene regulatory sequences in many cell types and organisms. We consider the problem of testing if two such sequences differ in their number of binding site motifs for a given transcription factor (TF) protein. Binding site motifs impart regulatory function by providing TFs the opportunity to bind to genomic elements and thereby affect the expression of nearby genes. Evolutionary changes to such functional DNA are hypothesized to be major contributors to phenotypic diversity within and between species; but despite the importance of TF motifs for gene expression, no method exists to test for motif loss or gain. Assuming that motif counts are Binomially distributed, and allowing for dependencies between motif instances in evolutionarily related sequences, we derive the probability mass function of the difference in motif counts between two nucleotide sequences. We provide a method to numerically estimate this distribution from genomic data and show through simulations that our estimator is accurate. Finally, we introduce the R package {\tt motifDiverge} that implements our methodology and illustrate its application to gene regulatory enhancers identified by a mouse developmental time course experiment. While this study was motivated by analysis of regulatory motifs, our results can be applied to any problem involving two correlated Bernoulli trials

arXiv.org e-Print Archive

Crossref

PubMed Central

eScholarship - University of California

Recommended from our members

A high-resolution map of human evolutionary constraint using 29 mammals.

Author: Alföldi Jessica
Baldwin Jen
Baylor College of Medicine Human Genome Sequencing Center Sequencing Team
Beal Kathryn
Birney Ewan
Bloom Toby
Broad Institute Sequencing Platform and Whole Genome Assembly Team
Chang Jean
Chin Chee Whye
Clamp Michele
Clawson Hiram
Cree Andrew
Cuff James
Delehaunty Kim
Di Palma Federica
Dihn Huyen H
Dooling David
Ernst Jason
Fitzgerald Stephen
Flicek Paul
Fowler Gerald
Fronik Catrina
Fulton Bob
Fulton Lucinda
Garber Manuel
Genome Institute at Washington University
Gibbs Richard A
Gnerre Sante
Goldman Nick
Graves Tina
Green Eric D
Guttman Mitchell
Haussler David
Heiman Dave
Herrero Javier
Holloway Alisha K
Hubisz Melissa J
Jaffe David B
Jhangiani Shalili
Jordan Gregory
Joshi Vandita
Jungreis Irwin
Kellis Manolis
Kent W James
Kheradpour Pouya
Kostka Dennis
Kovar Christie L
Lander Eric S
Lara Marcia
Lee Sandra
Lewis Lora R
Lin Michael F
Lindblad-Toh Kerstin
Lowe Craig B
Mardis Elaine R
Margulies Elliott H
Martins Andre L
Massingham Tim
Mauceli Evan
Minx Patrick
Moltke Ida
Muzny Donna M
Nazareth Lynne V
Nicol Robert
Nusbaum Chad
Okwuonu Geoffrey
Parker Brian J
Pedersen Jakob S
Pollard Katherine S
Raney Brian J
Rasmussen Matthew D
Robinson Jim
Santibanez Jireh
Siepel Adam
Sodergren Erica
Stark Alexander
Vilella Albert J
Ward Lucas D
Warren Wesley C
Washietl Stefan
Weinstock George M
Wen Jiayu
Wilkinson Jane
Wilson Richard K
Worley Kim C
Xie Xiaohui
Young Sarah
Zody Michael C
Zuk Or
Publication venue: eScholarship, University of California
Publication date: 01/10/2011
Field of study

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease

eScholarship - University of California

Genomic analysis of the relationship between gene expression variation and DNA polymorphism in Drosophila simulans

Author: Begun David J
Holloway Alisha K
Jones Corbin D
Lawniczak Mara KN
Publication venue: eScholarship, University of California
Publication date: 01/01/2008
Field of study

BackgroundUnderstanding how DNA sequence polymorphism relates to variation in gene expression is essential to connecting genotypic differences with phenotypic differences among individuals. Addressing this question requires linking population genomic data with gene expression variation.ResultsUsing whole genome expression data and recent light shotgun genome sequencing of six Drosophila simulans genotypes, we assessed the relationship between expression variation in males and females and nucleotide polymorphism across thousands of loci. By examining sequence polymorphism in gene features, such as untranslated regions and introns, we find that genes showing greater variation in gene expression between genotypes also have higher levels of sequence polymorphism in many gene features. Accordingly, X-linked genes, which have lower sequence polymorphism levels than autosomal genes, also show less expression variation than autosomal genes. We also find that sex-specifically expressed genes show higher local levels of polymorphism and divergence than both sex-biased and unbiased genes, and that they appear to have simpler regulatory regions.ConclusionThe gene-feature-based analyses and the X-to-autosome comparisons suggest that sequence polymorphism in cis-acting elements is an important determinant of expression variation. However, this relationship varies among the different categories of sex-biased expression, and trans factors might contribute more to male-specific gene expression than cis effects. Our analysis of sex-specific gene expression also shows that female-specific genes have been overlooked in analyses that only point to male-biased genes as having unusual patterns of evolution and that studies of sexually dimorphic traits need to recognize that the relationship between genetic and expression variation at these traits is different from the genome as a whole

Springer - Publisher Connector

Carolina Digital Repository

eScholarship - University of California

Adaptive Gene Expression Divergence Inferred from Population Genomics

Author: Alisha K Holloway
Corbin D Jones
David J Begun
Jason G Mezey
Mara K. N Lawniczak
Trudy F. C Mackay
Publication venue: eScholarship, University of California
Publication date: 01/01/2007
Field of study

Detailed studies of individual genes have shown that gene expression divergence often results from adaptive evolution of regulatory sequence. Genome-wide analyses, however, have yet to unite patterns of gene expression with polymorphism and divergence to infer population genetic mechanisms underlying expression evolution. Here, we combined genomic expression data--analyzed in a phylogenetic context--with whole genome light-shotgun sequence data from six Drosophila simulans lines and reference sequences from D. melanogaster and D. yakuba. These data allowed us to use molecular population genetics to test for neutral versus adaptive gene expression divergence on a genomic scale. We identified recent and recurrent adaptive evolution along the D. simulans lineage by contrasting sequence polymorphism within D. simulans to divergence from D. melanogaster and D. yakuba. Genes that evolved higher levels of expression in D. simulans have experienced adaptive evolution of the associated 3' flanking and amino acid sequence. Concomitantly, these genes are also decelerating in their rates of protein evolution, which is in agreement with the finding that highly expressed genes evolve slowly. Interestingly, adaptive evolution in 5' cis-regulatory regions did not correspond strongly with expression evolution. Our results provide a genomic view of the intimate link between selection acting on a phenotype and associated genic evolution

Crossref

Directory of Open Access Journals

Carolina Digital Repository

eScholarship - University of California

A High-Resolution Map of Human Evolutionary Constraint Using 29 Mammals

Author: A Keinan
A Siepel
A Siepel
A Stark
Adam Siepel
Albert J. Vilella
Alexander Stark
Alisha K. Holloway
Andre L. Martins
Brian J. Parker
Brian J. Raney
CB Lowe
Christie L. Kovar
Craig B. Lowe
D Altshuler
D Baek
D Pillas
D Schmidt
David B. Jaffe
David Haussler
Dennis Kostka
Donna M. Muzny
Elaine R. Mardis
Elliott H. Margulies
Eric D. Green
Eric S. Lander
ES Lander
ET Wang
EV Davydov
Evan Mauceli
Ewan Birney
F Chiaromonte
Federica Di Palma
G Bejerano
Genome 10K Community Of Scientists
George M. Weinstock
GM Cooper
Gregory Jordan
Hiram Clawson
Ida Moltke
Irwin Jungreis
J Ernst
J Ernst
J Harrow
JA Drake
Jakob S. Pedersen
James Cuff
Jason Ernst
Javier Herrero
Jean Chang
Jessica Alföldi
Jiayu Wen
Jim Robinson
JS Pedersen
JT Lee
JW Thomas
K Lindblad-Toh
Katherine S. Pollard
Kathryn Beal
KD Pruitt
Kerstin Lindblad-Toh
Kim C. Worley
KS Pollard
Lucas D. Ward
M Clamp
M Garber
M Guttman
M Kellis
Manolis Kellis
Manuel Garber
Marcia Lara
Maria L. Martínez-Chantar
Matthew D. Rasmussen
Melissa J. Hubisz
MF Lin
MF Lin
Michael C. Zody
Michael F. Lin
Michele Clamp
Mitchell Guttman
MJ Hubisz
Nick Goldman
Or Zuk
P Kheradpour
Paul Flicek
Pouya Kheradpour
RA Gibbs
RH Waterston
Richard A. Gibbs
Richard K. Wilson
S Gnerre
S Maenner
S Meader
S Prabhakar
S Tumpel
S Washietl
Sante Gnerre
Stefan Washietl
Stephen Fitzgerald
Tim Massingham
TS Mikkelsen
W. James Kent
Wesley C. Warren
X Lampe
X Xie
Xiaohui Xie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ~4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ~60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease.National Human Genome Research Institute (U.S.)National Institute of General Medical Sciences (U.S.) (Grant number GM82901)National Science Foundation (U.S.). Postdoctural Fellowship (Award 0905968)National Science Foundation (U.S.). Career (0644282)National Institutes of Health (U.S.) (R01-HG004037)Alfred P. Sloan Foundation.Austrian Science Fund. Erwin Schrodinger Fellowshi

DSpace@MIT

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Copenhagen University Research Information System

PubMed Central

eScholarship - University of California

Swepub

Chromatin remodelling complex dosage modulates transcription factor function in heart development

Author: Alexander Jeffrey M
Bruneau BG
Chambon P
Delgado-Olguín1 Paul
Harvey RP
Henkelman R Mark
Holloway AD
Holloway Alisha K
Metzger Daniel
Munson Chantilly
Munson Chantilly
Pollard KS
Scott IC
Stainier Didier YR
Sugizaki H
Takeuchi JK
Takeuchi JK
Wylie John N
Yeh Ru-Fang
Zhou Yu-Qing
Zhu Yonghong
Publication venue: eScholarship, University of California
Publication date: 01/01/2011
Field of study

Dominant mutations in cardiac transcription factor genes cause human inherited congenital heart defects (CHDs); however, their molecular basis is not understood. Interactions between transcription factors and the Brg1/Brm-associated factor (BAF) chromatin remodelling complex suggest potential mechanisms; however, the role of BAF complexes in cardiogenesis is not known. In this study, we show that dosage of Brg1 is critical for mouse and zebrafish cardiogenesis. Disrupting the balance between Brg1 and disease-causing cardiac transcription factors, including Tbx5, Tbx20 and Nkx2-5, causes severe cardiac anomalies, revealing an essential allelic balance between Brg1 and these cardiac transcription factor genes. This suggests that the relative levels of transcription factors and BAF complexes are important for heart development, which is supported by reduced occupancy of Brg1 at cardiac gene promoters in Tbx5 haploinsufficient hearts. Our results reveal complex dosage-sensitive interdependence between transcription factors and BAF complexes, providing a potential mechanism underlying transcription factor haploinsufficiency, with implications for multigenic inheritance of CHDs

Crossref

eScholarship - University of California

UNSWorks

Diposit Digital de Documents de la UAB

The western painted turtle genome, a model for the evolution of extreme physiological adaptations in a slowly evolving lineage

Author: Abramyan John
Amemiya Chris T
Badenhorst Daleen
Biggar Kyle K
Borchert Glen M
Botka Christopher W
Bowden Rachel M
Bradley Shaffer H
Braun Edward L
Bronikowski Anne M
Bruneau Benoit G
Buck Leslie T
Capel Blanche
Castoe Todd A
Czerwinski Mike
de Koning AP Jason
Delehaunty Kim D
Edwards Scott V
Fronick Catrina C
Fujita Matthew K
Fulton Lucinda
Graves Tina A
Green Richard E
Haerty Wilfried
Hariharan Ramkumar
Hernandez Omar
Hillier LaDeana W
Holloway Alisha K
Janes Daniel
Janzen Fredric J
Kandoth Cyriac
Kong Lesheng
Li Yang
Literman Robert
Mardis Elaine R
McGaugh Suzanne E
Minx Patrick
Mork Lindsey
O'Laughlin Michelle
Paitz Ryan T
Pollock David D
Ponting Chris P
Radhakrishnan Srihari
Raney Brian J
Richman Joy M
Schwartz Tonia
Sethuraman Arun
Shedlock Andrew M
Spinks Phillip Q
St John John
Storey Kenneth B
Thane Nay
Thomson Robert C
Valenzuela Nicole
Vinar Tomas
Warren Daniel E
Warren Wesley C
Wilson Richard K
Zimmerman Laura M
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

BackgroundWe describe the genome of the western painted turtle, Chrysemys picta bellii, one of the most widespread, abundant, and well-studied turtles. We place the genome into a comparative evolutionary context, and focus on genomic features associated with tooth loss, immune function, longevity, sex differentiation and determination, and the species' physiological capacities to withstand extreme anoxia and tissue freezing.ResultsOur phylogenetic analyses confirm that turtles are the sister group to living archosaurs, and demonstrate an extraordinarily slow rate of sequence evolution in the painted turtle. The ability of the painted turtle to withstand complete anoxia and partial freezing appears to be associated with common vertebrate gene networks, and we identify candidate genes for future functional analyses. Tooth loss shares a common pattern of pseudogenization and degradation of tooth-specific genes with birds, although the rate of accumulation of mutations is much slower in the painted turtle. Genes associated with sex differentiation generally reflect phylogeny rather than convergence in sex determination functionality. Among gene families that demonstrate exceptional expansions or show signatures of strong natural selection, immune function and musculoskeletal patterning genes are consistently over-represented.ConclusionsOur comparative genomic analyses indicate that common vertebrate regulatory networks, some of which have analogs in human diseases, are often involved in the western painted turtle's extraordinary physiological capacities. As these regulatory pathways are analyzed at the functional level, the painted turtle may offer important insights into the management of a number of human health disorders

Digital Repository @ Iowa State University (ISU)

Crossref

Springer - Publisher Connector

Harvard University - DASH

PubMed Central

eScholarship - University of California

University of British Columbia: cIRcle - UBC's Information Repository

University of East Anglia digital repository

The 2006 NESCent Phyloinformatics Hackathon: A Field Report

Author: Bala Sendu
Balhoff James P.
Bouck Amy
Goto Naohisa
Holder Mark T.
Holland Richard
Holloway Alisha K.
Katayama Toshiaki
Kosakovsky Pond Sergei L.
Lapp Hilmar
Lewis Paul O.
Mackey Aaron J.
Osborne Brian I.
Piel William H.
Poon Art F. Y.
Qui Wei-Gang
Stajich Jason E.
Stoltzfus Arlin
Thierer Tobais
Vilella Albert J.
Vision Todd J.
Vos Rutger A.
Zmasek Christian M.
Zwickl Derrick J.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/05/2012
Field of study

In December, 2006, a group of 26 software developers from some of the most widely used life science programming toolkits and phylogenetic software projects converged on Durham, North Carolina, for a Phyloinformatics Hackathon, an intense five-day collaborative software coding event sponsored by the National Evolutionary Synthesis Center (NESCent). The goal was to help researchers to integrate multiple phylogenetic software tools into automated workflows. Participants addressed deficiencies in interoperability between programs by implementing “glue code” and improving support for phylogenetic data exchange standards (particularly NEXUS) across the toolkits. The work was guided by use-cases compiled in advance by both developers and users, and the code was documented as it was developed. The resulting software is freely available for both users and developers through incorporation into the distributions of several widely-used open-source toolkits. We explain the motivation for the hackathon, how it was organized, and discuss some of the outcomes and lessons learned. We conclude that hackathons are an effective mode of solving problems in software interoperability and usability, and are underutilized in scientific software development

KU ScholarWorks (Univ. of Kansas)