Search CORE

452 research outputs found

The erratic mitochondrial clock: variations of mutation rate, not population size, affect mtDNA diversity across birds and mammals

Author: Galtier Nicolas
Glémin Sylvain
Nabholz Benoit
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background During the last ten years, major advances have been made in characterizing and understanding the evolution of mitochondrial DNA, the most popular marker of molecular biodiversity. Several important results were recently reported using mammals as model organisms, including (i) the absence of relationship between mitochondrial DNA diversity and life-history or ecological variables, (ii) the absence of prominent adaptive selection, contrary to what was found in invertebrates, and (iii) the unexpectedly large variation in neutral substitution rate among lineages, revealing a possible link with species maximal longevity. We propose to challenge these results thanks to the bird/mammal comparison. Direct estimates of population size are available in birds, and this group presents striking life-history trait differences with mammals (higher mass-specific metabolic rate and longevity). These properties make birds the ideal model to directly test for population size effects, and to discriminate between competing hypotheses about the causes of substitution rate variation. Results A phylogenetic analysis of cytochrome <it>b </it>third-codon position confirms that the mitochondrial DNA mutation rate is quite variable in birds, passerines being the fastest evolving order. On average, mitochondrial DNA evolves slower in birds than in mammals of similar body size. This result is in agreement with the longevity hypothesis, and contradicts the hypothesis of a metabolic rate-dependent mutation rate. Birds show no footprint of adaptive selection on cytochrome <it>b </it>evolutionary patterns, but no link between direct estimates of population size and cytochrome <it>b </it>diversity. The mutation rate is the best predictor we have of within-species mitochondrial diversity in birds. It partly explains the differences in mitochondrial DNA diversity patterns observed between mammals and birds, previously interpreted as reflecting Hill-Robertson interferences with the W chromosome. Conclusion Mitochondrial DNA diversity patterns in birds are strongly influenced by the wide, unexpected variation of mutation rate across species. From a fundamental point of view, these results are strongly consistent with a relationship between species maximal longevity and mitochondrial mutation rate, in agreement with the mitochondrial theory of ageing. Form an applied point of view, this study reinforces and extends the message of caution previously expressed for mammals: mitochondrial data tell nothing about species population sizes, and strongly depart the molecular clock assumption.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Bio++: a set of C++ libraries for sequence analysis, phylogenetics, molecular evolution and population genetics

Author: Bazin Eric
Belkhir Khalid
Dutheil Julien
Gaillard Sylvain
Galtier Nicolas
Glémin Sylvain
Ranwez Vincent
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: A large number of bioinformatics applications in the fields of bio-sequence analysis, molecular evolution and population genetics typically share input/ouput methods, data storage requirements and data analysis algorithms. Such common features may be conveniently bundled into re-usable libraries, which enable the rapid development of new methods and robust applications. RESULTS: We present Bio++, a set of Object Oriented libraries written in C++. Available components include classes for data storage and handling (nucleotide/amino-acid/codon sequences, trees, distance matrices, population genetics datasets), various input/output formats, basic sequence manipulation (concatenation, transcription, translation, etc.), phylogenetic analysis (maximum parsimony, markov models, distance methods, likelihood computation and maximization), population genetics/genomics (diversity statistics, neutrality tests, various multi-locus analyses) and various algorithms for numerical calculus. CONCLUSION: Implementation of methods aims at being both efficient and user-friendly. A special concern was given to the library design to enable easy extension and new methods development. We defined a general hierarchy of classes that allow the developer to implement its own algorithms while remaining compatible with the rest of the libraries. Bio++ source code is distributed free of charge under the CeCILL general public licence from its website

Springer - Publisher Connector

Directory of Open Access Journals

Gene expression drives the evolution of dominance.

Author: A Durvasula
A Platt
AF Agrawal
B Charlesworth
BM Henn
BY Kim
CD Huber
CD Huber
D Enard
D Ortega-Del Vecchyo
D Szklarczyk
DJ Balick
F Gao
F Manna
FH Shaw
H Kacser
HA Orr
I Frumkin
J Yang
JBS Haldane
JS Sanjak
KE Lohmueller
KM Teshima
LD Hurst
MA DePristo
MJ Simmons
N Phadnis
P Cingolani
P Lamesch
PY Novikova
RA Fisher
RD Hernandez
RN Gutenkunst
S Glémin
S Ossowski
S Williamson
S Wright
SH Williamson
T Bedford
T Kawakatsu
T Mukai
TI Gossmann
TT Hu
X Zheng
YB Simons
Publication venue: eScholarship, University of California
Publication date: 01/01/2018
Field of study

Dominance is a fundamental concept in molecular genetics and has implications for understanding patterns of genetic variation, evolution, and complex traits. However, despite its importance, the degree of dominance in natural populations is poorly quantified. Here, we leverage multiple mating systems in natural populations of Arabidopsis to co-estimate the distribution of fitness effects and dominance coefficients of new amino acid changing mutations. We find that more deleterious mutations are more likely to be recessive than less deleterious mutations. Further, this pattern holds across gene categories, but varies with the connectivity and expression patterns of genes. Our work argues that dominance arises as a consequence of the functional importance of genes and their optimal expression levels

Crossref

eScholarship - University of California

Genomic and proteomic biases inform metabolic engineering strategies for anaerobic fungi.

Author: Albà
Arazoe
Atasoglu
Bach
Beckham
Bezanson
Birdsell
Boch
Bonugli-Santos
Brownlee
Calkins
Camacho
Camiolo
Carlson
Chan
Chen
Cheng
Cheng
Chokhawala
Coker
Deshpande
Diener
Dollhofer
Duarte
Durand
Duret
Fondon
Galtier
Gasiunas
Gentzsch
Gerngross
Glass
Glémin
Greene
Grigoriev
Haitjema
Haitjema
Hamilton
Hanafy
Hartfield
Henske
Hershberg
Hildebrand
Hull
Jiang
Karlin
Kiktev
Kleinstiver
Kleinstiver
Knauer
Knight
Kuyper
Leberer
Li
Liggenstoffer
Liu
Magee
Mertens
Meunier
Morrison
Murphy
Nicholson
Nieuwenhuis
Nørholm
Orpin
Oyola
O’Malley
Podolsky
Raymond
Reichenberger
Ropars
Sadhu
Sammond
Sekowska
Seppälä
Seppälä
Solieri
Solomon
Sonan
Staben
Steensels
Steensels
Sukumaran
Theodorou
UniProt: a worldwide hub of protein knowledge
Videvall
Wang
Wang
Wright
Wu
Ximenes
Youssef
Zetsche
Publication venue: eScholarship, University of California
Publication date: 01/06/2020
Field of study

Anaerobic fungi (Neocallimastigomycota) are emerging non-model hosts for biotechnology due to their wealth of biomass-degrading enzymes, yet tools to engineer these fungi have not yet been established. Here, we show that the anaerobic gut fungi have the most GC depleted genomes among 443 sequenced organisms in the fungal kingdom, which has ramifications for heterologous expression of genes as well as for emerging CRISPR-based genome engineering approaches. Comparative genomic analyses suggest that anaerobic fungi may contain cellular machinery to aid in sexual reproduction, yet a complete mating pathway was not identified. Predicted proteomes of the anaerobic fungi also contain an unusually large fraction of proteins with homopolymeric amino acid runs consisting of five or more identical consecutive amino acids. In particular, threonine runs are especially enriched in anaerobic fungal carbohydrate active enzymes (CAZymes) and this, together with a high abundance of predicted N-glycosylation motifs, suggests that gut fungal CAZymes are heavily glycosylated, which may impact heterologous production of these biotechnologically useful enzymes. Finally, we present a codon optimization strategy to aid in the development of genetic engineering tools tailored to these early-branching anaerobic fungi

Crossref

eScholarship - University of California

A Model-Based Analysis of GC-Biased Gene Conversion in the Human and Chimpanzee Genomes

Author: A Auton
A Kong
A Navarro
A Necşulea
A Ratnakumar
A Siepel
Adam Siepel
AJ Jeffreys
AJ Webb
AP Boyle
BC Lamb
C Kosiol
CC Spencer
CF Mugal
D Karolchik
D Kostka
Dennis Kostka
E Mancera
G Marais
Graham Coop
J Berglund
J Harrow
J Romiguier
JA Capra
JM Chen
John A. Capra
JW IJdo
K Lindblad-Toh
K Pollard
Katherine S. Pollard
L Arbiza
L Duret
L Duret
LR Meyer
M Blanchette
M Hasegawa
Melissa J. Hubisz
MJ Hubisz
N Galtier
N Galtier
N Lartillot
P Flicek
P Stenson
RD George
S Glémin
S Katzman
S Katzman
S Myers
S Myers
SE Ptak
ST Sherry
T Nagylaki
TC Brown
TR Dreszer
W Winckler
Y Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

GC-biased gene conversion (gBGC) is a recombination-associated process that favors the fixation of G/C alleles over A/T alleles. In mammals, gBGC is hypothesized to contribute to variation in GC content, rapidly evolving sequences, and the fixation of deleterious mutations, but its prevalence and general functional consequences remain poorly understood. gBGC is difficult to incorporate into models of molecular evolution and so far has primarily been studied using summary statistics from genomic comparisons. Here, we introduce a new probabilistic model that captures the joint effects of natural selection and gBGC on nucleotide substitution patterns, while allowing for correlations along the genome in these effects. We implemented our model in a computer program, called phastBias, that can accurately detect gBGC tracts about 1 kilobase or longer in simulated sequence alignments. When applied to real primate genome sequences, phastBias predicts gBGC tracts that cover roughly 0.3% of the human and chimpanzee genomes and account for 1.2% of human-chimpanzee nucleotide differences. These tracts fall in clusters, particularly in subtelomeric regions; they are enriched for recombination hotspots and fast-evolving sequences; and they display an ongoing fixation preference for G and C alleles. They are also significantly enriched for disease-associated polymorphisms, suggesting that they contribute to the fixation of deleterious alleles. The gBGC tracts provide a unique window into historical recombination processes along the human and chimpanzee lineages. They supply additional evidence of long-term conservation of megabase-scale recombination rates accompanied by rapid turnover of hotspots. Together, these findings shed new light on the evolutionary, functional, and disease implications of gBGC. The phastBias program and our predicted tracts are freely available. © 2013 Capra et al

arXiv.org e-Print Archive

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

D-Scholarship@Pitt

FigShare

Substitution Patterns Are GC-Biased in Divergent Sequences across the Metazoans

Author: Berglund
Birdsell
Blanchette
Charlesworth
Clément
Coop
Cox
Dreszer
Duret
Duret
Eyre-Walker
Eyre-Walker
Fiston-Lavier
Fullerton
Galtier
Galtier
Glémin
Glémin
Groenen
Harrison
Hernandez
Hubisz
Hunter
Hurst
International Chicken Genome Sequencing Consortium
Jackson Laboratories
John A. Capra
Jones
Karolchik
Katherine S. Pollard
Katzman
Kent
Kent
Kong
Kuraku
Lynch
Mancera
Marais
Marais
Marais
Meunier
Oliver
Pollard
Pollard
Pollard
Prabhakar
R Development Core Team
Ratnakumar
Romiguier
Sherry
Shifman
Siepel
Siepel
Smit
The International Hapmap Consortium
Tsai
Tsai
Webster
Webster
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

The fastest-evolving regions in the human and chimpanzee genomes show a remarkable excess of weak (A,T) to strong (G,C) nucleotide substitutions since divergence from their common ancestor. We investigated the phylogenetic extent and possible causes of this weak to strong (W→S) bias in divergent sequences (BDS) using recently sequenced genomes and recombination maps from eight trios of eukaryotic species. To quantify evidence for BDS, we inferred substitution histories using an efficient maximum likelihood approach with a context-dependent evolutionary model. We then annotated all lineage-specific substitutions in terms of W→S bias and density on the chromosomes. Finally, we used the inferred substitutions to calculate a BDS score—a log odds ratio between substitution type and density—and assessed its statistical significance with Fisher's exact test. Applying this approach, we found significant BDS in the coding and noncoding sequence of human, mouse, dog, stickleback, fruit fly, and worm. We also observed a significant lack of W→S BDS in chicken and yeast. The BDS score varies between species and across the chromosomes within each species. It is most strongly correlated with different genomic features in different species, but a strong correlation with recombination rates is found in several species. Our results demonstrate that a W→S substitution bias in fast-evolving sequences is a widespread phenomenon. The patterns of BDS observed suggest that a recombination-associated process, such as GC-biased gene conversion, is involved in the production of the bias in many species, but the strength of the BDS likely depends on many factors, including genome stability, variability in recombination rate over time and across the genome, the frequency of meiosis, and the amount of outcrossing in each species

Crossref

PubMed Central

eScholarship - University of California

Evidence for strong fixation bias at 4-fold degenerate sites across genes in the great tit genome

Author: Akaike
Bernardi
Borges
Botero-Castro
Boĺivar
Corcoran
De Maio
Dreszer
Duret
Duret
Dutheil
Ellegren
Ellegren
Eyre-Walker
Felsenstein
Fletcher
Galtier
Glémin
Glémin
Gossmann
Guéguen
Hasegawa
Hillier
Hron
Hudson
Jayaswal
Jukes
Kawakami
Kimura
Kimura
Laine
Laine
Lartillot
Lovell
Matsumoto
McDonald
Nagylaki
Pouyet
Pouyet
Romanov
Romiguier
Scornavacca
Singhal
Smith
Warren
Weber
Yang
Yang
Zhang
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

It is well established that GC content varies across the genome in many species and that GC biased gene conversion, one form of meiotic recombination, is likely to contribute to this heterogeneity. Bird genomes provide an extraordinary system to study the impact of GC biased gene conversion owed to their specific genomic features. They are characterized by a high karyotype conservation with substantial heterogeneity in chromosome sizes, with up to a dozen large macrochromosomes and many smaller microchromosomes common across all bird species. This heterogeneity in chromosome morphology is also reflected by other genomic features, such as smaller chromosomes being gene denser, more compact and more GC rich relative to their macrochromosomal counterparts - illustrating that the intensity of GC biased gene conversion varies across the genome. Here we study whether it is possible to infer heterogeneity in GC biased gene conversion rates across the genome using a recently published method that accounts for GC biased gene conversion when estimating branch lengths in a phylogenetic context. To infer the strength of GC biased gene conversion we contrast branch length estimates across the genome both taking and not taking non-stationary GC composition into account. Using simulations we show that this approach works well when GC fixation bias is strong and note that the number of substitutions along a branch is consistently overestimated when GC biased gene conversion is not accounted for. We use this predictable feature to infer the strength of GC dynamics across the great tit genome by applying our new pipeline to data at 4-fold degenerate sites from three bird species-great tit, zebra finch and chicken-three species that are among the best annotated bird genomes to date. We show that using a simple one-dimensional binning we fail to capture a signal of fixation bias as observed in our simulations. However, using a multidimensional binning strategy, we find evidence for heterogeneity in the strength of fixation bias, including AT fixation bias. This highlights the difficulties when combining sequence data across different regions in the genome

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

Munin - Open Research Archive

NORA - Norwegian Open Research Archives

White Rose Research Online

New methods for inferring the distribution of fitness effects for INDELs and SNPs

Author: Ananda
Andolfatto
Andolfatto
Besenbacher
Blanchette
Bustamante
Corcoran
DePristo
Earl
Eyre-Walker
Eyre-Walker
Eyre-Walker
Galtier
Glémin
Haag-Liautard
Harris
Hartfield
Henry J Barton
Hernandez
Hu
Jackson
Jackson
John Parsch
Kai Zeng
Keightley
Keightley
Keightley
Kent
Kim
Kousathanas
Kvikstad
Leushkin
Leushkin
Li
Li
Montgomery
Muyle
Myers
Parsch
Parsch
Petrov
Petrov
Pool
Ptak
Sawyer
Schneider
Schrider
Tajima
Tajima
Tataru
Van der Auwera
Watterson
Yang
Yang
Zuk
Publication venue: 'Oxford University Press (OUP)'
Publication date: 04/04/2018
Field of study

Small insertions and deletions (INDELs; ≤50bp) are the most common type of variability after SNPs. However, compared to SNPs, we know little about the distribution of fitness effects (DFE) of new INDEL mutations and how prevalent adaptive INDEL substitutions are. Studying INDELs has been difficult partly because identifying ancestral states at these sites is error-prone and misidentification can lead to severely biased estimates of the strength of selection. To solve these problems, we develop new maximum likelihood methods, which use polymorphism data to simultaneously estimate the DFE, the mutation rate, and the misidentification rate. These methods are applicable to both INDELs and SNPs. Simulations show that they can provide highly accurate results. We applied the methods to an INDEL polymorphism dataset in Drosophila melanogaster. We found that the DFE for polymorphic INDELs in protein-coding regions is bimodal, with the variants being either nearly neutral or strongly deleterious. Based on the DFE, we estimated that 71.5% - 83.7% of the INDEL substitutions that took place along the D. melanogaster lineage were fixed by positive selection, which is comparable to the prevalence of adaptive substitutions at non-synonymous sites. The new methods have been implemented in the software package anavar

Crossref

White Rose Research Online

Population structure and genetic bottleneck in sweet cherry estimated with SSRs and the gametophytic self-incompatibility locus

Abstract Background Domestication and breeding involve the selection of particular phenotypes, limiting the genomic diversity of the population and creating a bottleneck. These effects can be precisely estimated when the location of domestication is established. Few analyses have focused on understanding the genetic consequences of domestication and breeding in fruit trees. In this study, we aimed to analyse genetic structure and changes in the diversity in sweet cherry <it>Prunus avium </it>L. Results Three subgroups were detected in sweet cherry, with one group of landraces genetically very close to the analysed wild cherry population. A limited number of SSR markers displayed deviations from the frequencies expected under neutrality. After the removal of these markers from the analysis, a very limited bottleneck was detected between wild cherries and sweet cherry landraces, with a much more pronounced bottleneck between sweet cherry landraces and modern sweet cherry varieties. The loss of diversity between wild cherries and sweet cherry landraces at the <it>S</it>-locus was more significant than that for microsatellites. Particularly high levels of differentiation were observed for some <it>S</it>-alleles. Conclusions Several domestication events may have happened in sweet cherry or/and intense gene flow from local wild cherry was probably maintained along the evolutionary history of the species. A marked bottleneck due to breeding was detected, with all markers, in the modern sweet cherry gene pool. The microsatellites did not detect the bottleneck due to domestication in the analysed sample. The vegetative propagation specific to some fruit trees may account for the differences in diversity observed at the <it>S</it>-locus. Our study provides insights into domestication events of cherry, however, requires confirmation on a larger sampling scheme for both sweet cherry landraces and wild cherry.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals