Search CORE

7 research outputs found

Fractionation statistics

Author: Sankoff David
Wang Baoyong
Zheng Chunfang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Paralog reduction, the loss of duplicate genes after whole genome duplication (WGD) is a pervasive process. Whether this loss proceeds gene by gene or through deletion of multi-gene DNA segments is controversial, as is the question of fractionation bias, namely whether one homeologous chromosome is more vulnerable to gene deletion than the other. Results As a null hypothesis, we first assume deletion events, on one homeolog only, excise a geometrically distributed number of genes with unknown mean <it>µ</it>, and these events combine to produce deleted runs of length l, distributed approximately as a negative binomial with unknown parameter <it>r</it>, itself a random variable with distribution <it>π</it>(·). A more realistic model requires deletion events on both homeologs distributed as a truncated geometric. We simulate the distribution of run lengths <it>l</it> in both models, as well as the underlying <it>π</it>(<it>r</it>), as a function of <it>µ</it>, and show how sampling <it>l</it> allows us to estimate <it>µ</it>. We apply this to data on a total of 15 genomes descended from 6 distinct WGD events and show how to correct the bias towards shorter runs caused by genome rearrangements. Because of the difficulty in deriving <it>π</it>(·) analytically, we develop a deterministic recurrence to calculate each <it>π</it>(<it>r</it>) as a function of <it>µ</it> and the proportion of unreduced paralog pairs. Conclusions The parameter <it>µ</it> can be estimated based on run lengths of single-copy regions. Estimates of <it>µ</it> in real data do not exclude the possibility that duplicate gene deletion is largely gene by gene, although it may sometimes involve longer segments.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A continuous analog of run length distributions reflecting accumulated fractionation events

Author: B Wang
C Zheng
D Sankoff
D Sankoff
D Sankoff
David Sankoff
JK Byrnes
MJ van Hoek
Zhe Yu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Gene order in rosid phylogeny, inferred from pairwise syntenies among extant genomes

Author: Chunfang Zheng
David Sankoff
Publication venue: Springer Nature
Publication date: 25/06/2012
Field of study

BACKGROUND: Ancestral gene order reconstruction for flowering plants has lagged behind developments in yeasts, insects and higher animals, because of the recency of widespread plant genome sequencing, sequencers' embargoes on public data use, paralogies due to whole genome duplication (WGD) and fractionation of undeleted duplicates, extensive paralogy from other sources, and the computational cost of existing methods. RESULTS: We address these problems, using the gene order of four core eudicot genomes (cacao, castor bean, papaya and grapevine) that have escaped any recent WGD events, and two others (poplar and cucumber) that descend from independent WGDs, in inferring the ancestral gene order of the rosid clade and those of its main subgroups, the fabids and malvids. We improve and adapt techniques including the OMG method for extracting large, paralogy-free, multiple orthologies from conflated pairwise synteny data among the six genomes and the PATHGROUPS approach for ancestral gene order reconstruction in a given phylogeny, where some genomes may be descendants of WGD events. We use the gene order evidence to evaluate the hypothesis that the order Malpighiales belongs to the malvids rather than as traditionally assigned to the fabids. CONCLUSIONS: Gene orders of ancestral eudicot species, involving 10,000 or more genes can be reconstructed in an efficient, parsimonious and consistent way, despite paralogies due to WGD and other processes. Pairwise genomic syntenies provide appropriate input to a parameter-free procedure of multiple ortholog identification followed by gene-order reconstruction in solving instances of the "small phylogeny" problem

Springer - Publisher Connector

PubMed Central

Gene order in rosid phylogeny, inferred from pairwise syntenies among extant genomes

Author: A Muñnoz
A Ouangraoua
AP Chan
B Thomas
B Wang
C Zheng
C Zheng
C Zheng
Chunfang Zheng
D Bertrand
D Sankoff
D Sankoff
D Sankoff
David Sankoff
DE Soltis
E Lyons
E Lyons
E Tannier
F Forest
G Moore
GA Tuskan
H Tang
J Ma
JG Burleigh
JL Gordon
N El-Mabrouk
O Jaillon
R Ming
R Velasco
RJ Langham
S Huang
S Warshall
S Yancopoulos
V Shulaev
WJ Murphy
X Argout
Z Adam
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Following Tetraploidy in Maize, a Short Deletion Mechanism Removed Genes Preferentially from One of the Two Homeologs

Author: A. H Paterson
B. C Thomas
B. J Haas
B. S Gaut
Brent S. Pedersen
C Simillion
D Lisch
D. A Petrov
D. R Schrider
Damon Lisch
E Lyons
E Lyons
E. R Liman
Eric Lyons
H Tang
J Lai
J. G Walling
J. L Pasieka
James C. Schnable
K. M Devos
Kenneth H. Wolfe
M Freeling
M Freeling
M Freeling
M Kasahara
M Lynch
M Lynch
M Lynch
Margaret R. Woodhouse
Michael Freeling
P SanMiguel
P. A Ziolkowski
P. S Schnable
R Fischer
R. J Langham
S Ahn
S Henikoff
S. F Altschul
Sb Needlema
Shabarinath Subramaniam
X Wang
Z Swigonova
Z. H Yang
Publication venue: Public Library of Science
Publication date: 29/06/2010
Field of study

Following genome duplication and selfish DNA expansion, maize used a heretofore unknown mechanism to shed redundant genes and functionless DNA with bias toward one of the parental genomes

Public Library of Science (PLOS)

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

PubMed Central