Search CORE

77 research outputs found

Screening synteny blocks in pairwise genome comparisons through integer programming

Author: Andrew H Paterson
BJ Haas
Brent Pedersen
C Simillion
C Simillion
C Soderlund
E Lyons
E Lyons
Eric Lyons
G Tesler
H Tang
H Tang
Haibao Tang
HW Six
James C Schnable
JE Bowers
JM Aury
JM Catchen
K Yogeeswaran
L Cui
M Kellis
Michael Freeling
O Jaillon
O Jaillon
P Pevzner
Q Peng
R Warren
RM Karp
S Schwartz
SF Altschul
W Miller
WJ Kent
X Wang
Y Van de Peer
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background It is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor. Analyses are particularly problematic among lineages that have repeatedly experienced whole genome duplication (WGD) events. To compare multiple "subgenomes" derived from genome duplications, we need to relax the traditional requirements of "one-to-one" syntenic matchings of genomic regions in order to reflect "one-to-many" or more generally "many-to-many" matchings. However this relaxation may result in the identification of synteny blocks that are derived from ancient shared WGDs that are not of interest. For many downstream analyses, we need to eliminate weak, low scoring alignments from pairwise genome comparisons. Our goal is to objectively select subset of synteny blocks whose total scores are maximized while respecting the duplication history of the genomes in comparison. We call this "quota-based" screening of synteny blocks in order to appropriately fill a quota of syntenic relationships within one genome or between two genomes having WGD events. Results We have formulated the synteny block screening as an optimization problem known as "Binary Integer Programming" (BIP), which is solved using existing linear programming solvers. The computer program QUOTA-ALIGN performs this task by creating a clear objective function that maximizes the compatible set of synteny blocks under given constraints on overlaps and depths (corresponding to the duplication history in respective genomes). Such a procedure is useful for any pairwise synteny alignments, but is most useful in lineages affected by multiple WGDs, like plants or fish lineages. For example, there should be a 1:2 ploidy relationship between genome A and B if genome B had an independent WGD subsequent to the divergence of the two genomes. We show through simulations and real examples using plant genomes in the rosid superorder that the quota-based screening can eliminate ambiguous synteny blocks and focus on specific genomic evolutionary events, like the divergence of lineages (in cross-species comparisons) and the most recent WGD (in self comparisons). Conclusions The QUOTA-ALIGN algorithm screens a set of synteny blocks to retain only those compatible with a user specified ploidy relationship between two genomes. These blocks, in turn, may be used for additional downstream analyses such as identifying true orthologous regions in interspecific comparisons. There are two major contributions of QUOTA-ALIGN: 1) reducing the block screening task to a BIP problem, which is novel; 2) providing an efficient software pipeline starting from all-against-all BLAST to the screened synteny blocks with dot plot visualizations. Python codes and full documentations are publicly available <url>http://github.com/tanghaibao/quota-alignment</url>. QUOTA-ALIGN program is also integrated as a major component in SynMap <url>http://genomevolution.com/CoGe/SynMap.pl</url>, offering easier access to thousands of genomes for non-programmers.</p

Crossref

DigitalCommons@University of Nebraska

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Arizona

eScholarship - University of California

Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing

Author: André Gilles
AR Quinlan
Christopher W. Wheat
D Hamilton
DA Hahn
Emese Meglécz
F Saeed
IW Saunders
J. C. Dohm
Jean-François Martin
JM Aury
JS Reis-Filho
KJ Hoff
KM Wegner
M Lynch
M Lynch
M Margulies
MA Larkin
Maxime Galan
Nicolas Pech
P McCullagh
PJ Campbell
SF Altschul
SM Huse
Steve Hoffmann
Stéphanie Ferreira
Susan M Huse
Sverker Lundin
Thibaut Malausa
V Kunin
W Babik
XiaoGuang Zhou
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The rapid evolution of 454 GS-FLX sequencing technology has not been accompanied by a reassessment of the quality and accuracy of the sequences obtained. Current strategies for decision-making and error-correction are based on an initial analysis by Huse <it>et al. </it>in 2007, for the older GS20 system based on experimental sequences. We analyze here the quality of 454 sequencing data and identify factors playing a role in sequencing error, through the use of an extensive dataset for Roche control DNA fragments. Results We obtained a mean error rate for 454 sequences of 1.07%. More importantly, the error rate is not randomly distributed; it occasionally rose to more than 50% in certain positions, and its distribution was linked to several experimental variables. The main factors related to error are the presence of homopolymers, position in the sequence, size of the sequence and spatial localization in PT plates for insertion and deletion errors. These factors can be described by considering seven variables. No single variable can account for the error rate distribution, but most of the variation is explained by the combination of all seven variables. Conclusions The pattern identified here calls for the use of internal controls and error-correcting base callers, to correct for errors, when available (e.g. when sequencing amplicons). For shotgun libraries, the use of both sequencing primers and deep coverage, combined with the use of random sequencing primer sites should partly compensate for even high error rates, although it may prove more difficult than previous thought to distinguish between low-frequency alleles and errors.</p

Crossref

Springer - Publisher Connector

HAL AMU

Directory of Open Access Journals

Elusive Origins of the Extra Genes in Aspergillus oryzae

Author: AH Paterson
C Hall
C Simillion
DM Geiser
F Delsuc
G Ricard
GC Conant
H Nishida
J Castresana
J Felsenstein
J Kamper
JD Thompson
JE Galagan
JE Nixon
JJ Cai
JM Aury
JM Lee
JO Andersson
K Tamano
Kenneth H. Wolfe
KH Wolfe
KP Byrne
M Kellis
M Lynch
M Lynch
M Machida
MA Fares
Nora Khaldi
NU Frigaard
O Jaillon
P Dehal
PM Sharp
RB Langkjaer
S Garcia-Vallve
S Guindon
Sudhindra Gadagkar
T Hamada
T Kobayashi
WC Nierman
Y van de Peer
Y van de Peer
Z Yang
Z Yang
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

The genome sequence of Aspergillus oryzae revealed unexpectedly that this species has approximately 20% more genes than its congeneric species A. nidulans and A. fumigatus. Where did these extra genes come from? Here, we evaluate several possible causes of the elevated gene number. Many gene families are expanded in A. oryzae relative to A. nidulans and A. fumigatus, but we find no evidence of ancient whole-genome duplication or other segmental duplications, either in A. oryzae or in the common ancestor of the genus Aspergillus. We show that the presence of divergent pairs of paralogs is a feature peculiar to A. oryzae and is not shared with A. nidulans or A. fumigatus. In phylogenetic trees that include paralog pairs from A. oryzae, we frequently find that one of the genes in a pair from A. oryzae has the expected orthologous relationship with A. nidulans, A. fumigatus and other species in the subphylum Eurotiomycetes, whereas the other A. oryzae gene falls outside this clade but still within the Ascomycota. We identified 456 such gene pairs in A. oryzae. Further phylogenetic analysis did not however indicate a single consistent evolutionary origin for the divergent members of these pairs. Approximately one-third of them showed phylogenies that are suggestive of horizontal gene transfer (HGT) from Sordariomycete species, and these genes are closer together in the A. oryzae genome than expected by chance, but no unique Sordariomycete donor species was identifiable. The postulated HGTs from Sordariomycetes still leave the majority of extra A. oryzae genes unaccounted for. One possible explanation for our observations is that A. oryzae might have been the recipient of many separate HGT events from diverse donors

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Irish Universities

PubMed Central

The mitochondrial genome sequence of the ciliate Paramecium caudatum reveals a shift in nucleotide composition and codon usage within the genus Paramecium

Author: A Tatarenkov
AE Pritchard
C Chantangsi
CE Popescu
CF Brunk
CM Strüder-Kypke
D Barth
D Barth
Dana Barth
DH Lunt
DR Smith
E Przybos
E Przybos
E Przybos
EL Clare
F Catania
F Wright
FN Martin
G Bernardi
G Burger
G Marais
H Jiang
HMW Salim
JA Eisen
JM Aury
K Tamura
KJ Verstrepen
M Berriman
MM Moradian
MW Gray
N Galtier
N Goldman
P Bois
P Jordan
P Stothard
RM De Graaf
S Karlin
SF Altschul
SI Fokin
TA Hall
Thomas U Berendonk
TM Lowe
UV Wesley
X Chen
XJ Min
Y Tsukii
Z Antal
Z Yang
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Despite the fact that the organization of the ciliate mitochondrial genome is exceptional, only few ciliate mitochondrial genomes have been sequenced until today. All ciliate mitochondrial genomes are linear. They are 40 kb to 47 kb long and contain some 50 tightly packed genes without introns. Earlier studies documented that the mitochondrial guanine + cytosine contents are very different between <it>Paramecium tetraurelia </it>and all studied <it>Tetrahymena </it>species. This raises the question of whether the high mitochondrial G+C content observed in <it>P. tetraurelia </it>is a characteristic property of <it>Paramecium </it>mtDNA, or whether it is an exception of the ciliate mitochondrial genomes known so far. To test this question, we determined the mitochondrial genome sequence of <it>Paramecium caudatum </it>and compared the gene content and sequence properties to the closely related <it>P. tetraurelia</it>. Results The guanine + cytosine content of the <it>P. caudatum </it>mitochondrial genome was significantly lower than that of <it>P. tetraurelia </it>(22.4% vs. 41.2%). This difference in the mitochondrial nucleotide composition was accompanied by significantly different codon usage patterns in both species, i.e. within <it>P. caudatum </it>clearly A/T ending codons dominated, whereas for <it>P. tetraurelia </it>the synonymous codons were more balanced with a higher number of G/C ending codons. Further analyses indicated that the nucleotide composition of most members of the genus <it>Paramecium </it>resembles that of <it>P. caudatum </it>and that the shift observed in <it>P. tetraurelia </it>is restricted to the <it>P. aurelia </it>species complex. Conclusions Surprisingly, the codon usage bias in the <it>P. caudatum </it>mitochondrial genome, exemplified by the effective number of codons, is more similar to the distantly related <it>T. pyriformis </it>and other single-celled eukaryotes such as <it>Chlamydomonas</it>, than to the closely related <it>P. tetraurelia</it>. These differences in base composition and codon usage bias were, however, not reflected in the amino acid composition. Most probably, the observed picture is best explained by a hitherto unknown (neutral or adaptive) mechanism that increased the guanine + cytosine content in <it>P. tetraurelia </it>mtDNA on the one hand, and strong purifying selection on the ancestral amino acid composition on the other hand. These contradicting forces are counterbalanced by a considerably altered codon usage pattern.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Analysis of the P. lividus sea urchin genome highlights contrasting trends of genomic and regulatory evolution in deuterostomes

Author: Anello L
Arnone MI
Aury JM
Barbe V
Ben Tabou de Leon S
Besnardeau L
Borra M
Cavalieri V
Chessel A
Copley RR
Cormier P
Costa C
Couloux A
Croce J
Da Silva C
Di Bernardo M
Di Carlo M
Dru P
Exposito JY
Gache C
Gavriouchkina D
Geneviève AM
Labadie K
Le Gras S
Lepage T
Lhomond G
Lowe EK
Mangenot S
Marlétaz F
Martinez P
Matranga V
Molina MD
Morales J
Nicosia A
Noel B
Oliveri P
Pascual M
Pegueroles C
Poulain J
Poustka AJ
Ragusa MA
Russo R
Turon X
Wincker P
Ye T
Zito F
Publication venue: 'Elsevier BV'
Publication date: 12/04/2023
Field of study

Sea urchins are emblematic models in developmental biology and display several characteristics that set them apart from other deuterostomes. To uncover the genomic cues that may underlie these specificities, we generated a chromosome-scale genome assembly for the sea urchin Paracentrotus lividus and an extensive gene expression and epigenetic profiles of its embryonic development. We found that, unlike vertebrates, sea urchins retained ancestral chromosomal linkages but underwent very fast intrachromosomal gene order mixing. We identified a burst of gene duplication in the echinoid lineage and showed that some of these expanded genes have been recruited in novel structures (water vascular system, Aristotle's lantern, and skeletogenic micromere lineage). Finally, we identified gene-regulatory modules conserved between sea urchins and chordates. Our results suggest that gene-regulatory networks controlling development can be conserved despite extensive gene order rearrangement

UCL Discovery

The Origin of GPCRs: Identification of Mammalian like Rhodopsin, Adhesion, Glutamate and Frizzled GPCRs in Fungi

Author: A Lafon
Arunkumar Krishnan
C Xue
Chaoyang Xue
DH O'Day
DM Morens
DM Rosenbaum
DW Warnock
EJ Byrnes 3rd
EV Armbrust
F Ronquist
F Silveira
Helgi B. Schiöth
I Ruiz-Trillo
I Ruiz-Trillo
JA Eisen
JH Yu
JM Aury
JR Xu
K Katoh
K Palczewski
KJ Nordstrom
KJ Nordstrom
KJ Nordstrom
L Eichinger
L Kall
L Li
M Carr
M Medina
M Rask-Andersen
M Srivastava
Markus Sällman Almén
MC Lagerstrom
MS Almen
N Kamesh
PS Klein
R Fredriksson
R Fredriksson
RD Finn
RD Kulkarni
Robert Fredriksson
S Guindon
SR Eddy
T Brody
TK Attwood
TK Bjarnadottir
TY James
W Li
W Meersseman
Y Wang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

G protein-coupled receptors (GPCRs) in humans are classified into the five main families named Glutamate, Rhodopsin, Adhesion, Frizzled and Secretin according to the GRAFS classification. Previous results show that these mammalian GRAFS families are well represented in the Metazoan lineages, but they have not been shown to be present in Fungi. Here, we systematically mined 79 fungal genomes and provide the first evidence that four of the five main mammalian families of GPCRs, namely Rhodopsin, Adhesion, Glutamate and Frizzled, are present in Fungi and found 142 novel sequences between them. Significantly, we provide strong evidence that the Rhodopsin family emerged from the cAMP receptor family in an event close to the split of Opisthokonts and not in Placozoa, as earlier assumed. The Rhodopsin family then expanded greatly in Metazoans while the cAMP receptor family is found in 3 invertebrate species and lost in the vertebrates. We estimate that the Adhesion and Frizzled families evolved before the split of Unikonts from a common ancestor of all major eukaryotic lineages. Also, the study highlights that the fungal Adhesion receptors do not have N-terminal domains whereas the fungal Glutamate receptors have a broad repertoire of mammalian-like N-terminal domains. Further, mining of the close unicellular relatives of the Metazoan lineage, Salpingoeca rosetta and Capsaspora owczarzaki, obtained a rich group of both the Adhesion and Glutamate families, which in particular provided insight to the early emergence of the N-terminal domains of the Adhesion family. We identified 619 Fungi specific GPCRs across 79 genomes and revealed that Blastocladiomycota and Chytridiomycota phylum have Metazoan-like GPCRs rather than the GPCRs specific for Fungi. Overall, this study provides the first evidence of the presence of four of the five main GRAFS families in Fungi and clarifies the early evolutionary history of the GPCR superfamily

Public Library of Science (PLOS)

Crossref

Publikationer från Uppsala Universitet

Directory of Open Access Journals

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

FigShare

Ralstonia syzygii, the Blood Disease Bacterium and Some Asian R. solanacearum Strains Form a Single Genomic Species Despite Divergent Lifestyles

Author: A Guidot
A Guidot
A Kelman
A Mackie
A Mira
AC Hayward
AC Hayward
AH Purcell
Aurélie Lajus
B Coupat
B Remenant
Benoît Remenant
C Luo
CA Boucher
Caitilyn Allen
Ching-Hong Yang
Claudine Medigue
CPA Bennett
CPA Bennett
D Costechareyre
D Gordon
D Vallenet
David Vallenet
DG Brown
DG Brown
DJ Studholme
DW Gabriel
E Gäumann
E Stackebrandt
F Bertolla
G Cellier
Gilles Cellier
H Brussow
H Song
I Pieretti
J Colburn-Clifford
J Goris
J Vasse
J Yao
J-C Cho
Jean-Charles de Cambiaire
JG Lawrence
JM Aury
JM Dow
JM Jacobs
Jonathan M. Jacobs
KT Konstantinidis
M Fegan
M Fegan
M McClelland
M Salanoubat
M Taghavi
M Vaneechoutte
MA Schembri
Mark Fegan
MJ Pallen
MR Gillings
N Killiny
N Killiny
NA Moran
P Kuhnert
P Prior
P Stothard
Philippe Prior
R Caserta
RH Stover
RL Rich
S Cunnac
S Eden-Green
S Eden-Green
S Kurtz
S Subandiyah
SJ Eden-Green
SJ Roberts
Sophie Mangenot
ST Cole
Supriadi
T De Baere
TM Voegel
V Aritua
V Grimault
V Ndungo
Valérie Barbe
W Tushemereirwe
YM Lin
Publication venue: Public Library of Science
Publication date: 08/09/2011
Field of study

The Ralstonia solanacearum species complex includes R. solanacearum, R. syzygii, and the Blood Disease Bacterium (BDB). All colonize plant xylem vessels and cause wilt diseases, but with significant biological differences. R. solanacearum is a soilborne bacterium that infects the roots of a broad range of plants. R. syzygii causes Sumatra disease of clove trees and is actively transmitted by cercopoid insects. BDB is also pathogenic to a single host, banana, and is transmitted by pollinating insects. Sequencing and DNA-DNA hybridization studies indicated that despite their phenotypic differences, these three plant pathogens are actually very closely related, falling into the Phylotype IV subgroup of the R. solanacearum species complex. To better understand the relationships among these bacteria, we sequenced and annotated the genomes of R. syzygii strain R24 and BDB strain R229. These genomes were compared to strain PSI07, a closely related Phylotype IV tomato isolate of R. solanacearum, and to five additional R. solanacearum genomes. Whole-genome comparisons confirmed previous phylogenetic results: the three phylotype IV strains share more and larger syntenic regions with each other than with other R. solanacearum strains. Furthermore, the genetic distances between strains, assessed by an in-silico equivalent of DNA-DNA hybridization, unambiguously showed that phylotype IV strains of BDB, R. syzygii and R. solanacearum form one genomic species. Based on these comprehensive data we propose a revision of the taxonomy of the R. solanacearum species complex. The BDB and R. syzygii genomes encoded no obvious unique metabolic capacities and contained no evidence of horizontal gene transfer from bacteria occupying similar niches. Genes specific to R. syzygii and BDB were almost all of unknown function or extrachromosomal origin. Thus, the pathogenic life-styles of these organisms are more probably due to ecological adaptation and genomic convergence during vertical evolution than to the acquisition of DNA by horizontal transfer

Public Library of Science (PLOS)

HAL Evry

Crossref

Directory of Open Access Journals

The fitness cost of mis-splicing is the main determinant of alternative splicing patterns

Author: A Reyes
AK Ramani
Alexandra Popa
Anamaria Necsulea
Baptiste Saudemont
BJ Blencowe
BR Graveley
C Trapnell
Corinne Blugeon
CR Edwards
E Dubois
E Kim
E Melamud
Eric Meyer
F Abascal
FM Hamid
G Drechsel
I Ezkurdia
J Beisson
J Merkin
J Weischenfeldt
J-M Aury
JE Smith
JJ-L Wong
JJL Wong
JK Pickrell
JM Mudge
Joanna L. Parmley
JZ Ni
L Duret
Laurent Duret
LF Lareau
LF Lareau
M Bulmer
M Graille
M Irimia
M Kalyna
M Wang
ML Tress
ML Tress
MW-L Popp
N Stepankiw
NJ McGlincy
NL Barbosa-Morais
O Garnier
O Jaillon
O Kelemen
PL Boutz
RGH Lindeboom
The 1000 Genomes Project Consortium
TW Nilsen
U Braunschweig
Vincent Rocher
W Sung
W Sung
Y Ge
Y Marquez
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Background Most eukaryotic genes are subject to alternative splicing (AS), which may contribute to the production of protein variants or to the regulation of gene expression via nonsense-mediated messenger RNA (mRNA) decay (NMD). However, a fraction of splice variants might correspond to spurious transcripts and the question of the relative proportion of splicing errors to functional splice variants remains highly debated. Results We propose a test to quantify the fraction of AS events corresponding to errors. This test is based on the fact that the fitness cost of splicing errors increases with the number of introns in a gene and with expression level. We analyzed the transcriptome of the intron-rich eukaryote Paramecium tetraurelia. We show that in both normal and in NMD-deficient cells, AS rates strongly decrease with increasing expression level and with increasing number of introns. This relationship is observed for AS events that are detectable by NMD as well as for those that are not, which invalidates the hypothesis of a link with the regulation of gene expression. Our results show that in genes with a median expression level, 92–98% of observed splice variants correspond to errors. We observed the same patterns in human transcriptomes and we further show that AS rates correlate with the fitness cost of splicing errors. Conclusions These observations indicate that genes under weaker selective pressure accumulate more maladaptive substitutions and are more prone to splicing errors. Thus, to a large extent, patterns of gene expression variants simply reflect the balance between selection, mutation, and drift

Central Archive at the University of Reading

Crossref

ZENODO

Directory of Open Access Journals

HAL-Inserm

INRIA a CCSD electronic archive server

HAL Descartes

Distinct Gene Number-Genome Size Relationships for Eukaryotes and Non-Eukaryotes: Gene Content Estimation for Dinoflagellate Genomes

Author: AC Ivens
AG Hinnebusch
AR Loeblich III
CH Slamovits
CH Slamovits
D Lee
DC Sigee
DL Spector
DM Anderson
DW Coats
FM Van Dolah
H Moreau
H Zhang
H Zhang
H Zhang
J Archibald
J Lukes
J Ramsey
J Reichman
JD Hackett
JD Hackett
JM Aury
JR Allen
KH Wolfe
KT Konstantinidis
L Pfiester
L Xu
LY Liu
M Berriman
M Lynch
M Lynch
M McEwan
MJW Veldhuis
NJ Patron
NJ Patron
O Holm-Hansen
P Salois
PJ Rizzo
PJ Rizzo
QH Le
RE Steel
RJ Blank
Rosemary Jeanne Redfield
S Lin
S Lin
Senjie Lin
SR Santos
T Bertomeu
TC LaJeunesse
TM Roberts
TR Bachvaroff
TR Gregory
TR Gregory
TR Gregory
Y Bhaud
Y Bouligand
YH Chan
Yubo Hou
Publication venue: Public Library of Science
Publication date: 01/09/2009
Field of study

The ability to predict gene content is highly desirable for characterization of not-yet sequenced genomes like those of dinoflagellates. Using data from completely sequenced and annotated genomes from phylogenetically diverse lineages, we investigated the relationship between gene content and genome size using regression analyses. Distinct relationships between log10-transformed protein-coding gene number (Y′) versus log10-transformed genome size (X′, genome size in kbp) were found for eukaryotes and non-eukaryotes. Eukaryotes best fit a logarithmic model, Y′ = ln(-46.200+22.678X′, whereas non-eukaryotes a linear model, Y′ = 0.045+0.977X′, both with high significance (p<0.001, R2>0.91). Total gene number shows similar trends in both groups to their respective protein coding regressions. The distinct correlations reflect lower and decreasing gene-coding percentages as genome size increases in eukaryotes (82%–1%) compared to higher and relatively stable percentages in prokaryotes and viruses (97%–47%). The eukaryotic regression models project that the smallest dinoflagellate genome (3×106 kbp) contains 38,188 protein-coding (40,086 total) genes and the largest (245×106 kbp) 87,688 protein-coding (92,013 total) genes, corresponding to 1.8% and 0.05% gene-coding percentages. These estimates do not likely represent extraordinarily high functional diversity of the encoded proteome but rather highly redundant genomes as evidenced by high gene copy numbers documented for various dinoflagellate species

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Absence of Positive Selection on Centromeric Histones in Tetrahymena Suggests Unsuppressed Centromere-Drive in Lineages Lacking Male Meiosis

Centromere-drive is a process where centromeres compete for transmission through asymmetric "female" meiosis for inclusion into the oocyte. In symmetric "male" meiosis, all meiotic products form viable germ cells. Therefore, the primary incentive for centromere-drive, a potential transmission bias, is believed to be missing from male meiosis. In this article, we consider whether male meiosis also bears the primary cost of centromere-drive. Because different taxa carry out different combinations of meiotic programs (symmetric + asymmetric, symmetric only, asymmetric only), it is possible to consider the evolutionary consequences of centromere-drive in the context of these differing systems. Groups with both types of meiosis have large, rapidly evolving centromeric regions, and their centromeric histones (CenH3s) have been shown to evolve under positive selection, suggesting roles as suppressors of centromere-drive. In contrast, taxa with only symmetric male meiosis have shown no evidence of positive selection in their centromeric histones. In this article, we present the first evolutionary analysis of centromeric histones in ciliated protozoans, a group that only undergoes asymmetric "female" meiosis. We find no evidence of positive selection acting on CNA1, the CenH3 of Tetrahymena species. Cytological observations of a panel of Tetrahymena species are consistent with dynamic karyotype evolution in this lineage. Our findings suggest that defects in male meiosis, and not mitosis or female meiosis, are the primary selective force behind centromere-drive suppression. Our study raises the possibility that taxa like ciliates, with only female meiosis, may therefore undergo unsuppressed centromere drive

Crossref

Springer - Publisher Connector

PubMed Central