Search CORE

Springer - Publisher Connector

INRIA a CCSD electronic archive server

Safe and complete contig assembly via omnitigs

Author: A Bankevich
A Guénoche
AR Rubinov
AS Motahari
C Kingsford
D Haussler
DR Zerbino
E Kapun
E Kapun
ES Lander
G Bresler
G Narzisi
I Lysov
JD Kececioglu
JR Miller
JT Simpson
JT Simpson
K Lam
K Sahlin
L Salmela
M Boetzer
M Boetzer
N Nagarajan
N Nagarajan
N Vyahhi
P Medvedev
P Medvedev
P Medvedev
PA Pevzner
PA Pevzner
R Chikhi
R Chikhi
R Luo
R Uricaru
RM Idury
SL Salzberg
Publication venue
Publication date: 16/08/2016
Field of study

Contig assembly is the first stage that most assemblers solve when reconstructing a genome from a set of reads. Its output consists of contigs -- a set of strings that are promised to appear in any genome that could have generated the reads. From the introduction of contigs 20 years ago, assemblers have tried to obtain longer and longer contigs, but the following question was never solved: given a genome graph

G

(e.g. a de Bruijn, or a string graph), what are all the strings that can be safely reported from

G

as contigs? In this paper we finally answer this question, and also give a polynomial time algorithm to find them. Our experiments show that these strings, which we call omnitigs, are 66% to 82% longer on average than the popular unitigs, and 29% of dbSNP locations have more neighbors in omnitigs than in unitigs.Comment: Full version of the paper in the proceedings of RECOMB 201

arXiv.org e-Print Archive

Access to Research and Communications Annals

The scaling of genetic diversity in a changing and fragmented world

Author: Arenas M.
Chikhi L.
Currat M.
Excoffier L.
Mona S.
Rasteiro R.
Ray N.
Schmeller D.S.
Sramkova Hanulova A.
Trochet A.
Publication venue: 'Pensoft Publishers'
Publication date: 01/01/2013
Field of study

Most species do not live in a constant environment over space or time. Their environment is often heterogeneous with a huge variability in resource availability and exposure to pathogens or predators, which may affect the local densities of the species. Moreover, the habitat might be fragmented, preventing free and isotropic migrations between local sub-populations (demes) of a species, making some demes more isolated than others. For example, during the last ice age populations of many species migrated towards refuge areas from which re-colonization originated when conditions improved. However, populations that could not move fast enough or could not adapt to the new environmental conditions faced extinctions. Populations living in these types of dynamic environments are often referred to as metapopulations and modeled as an array of subdivisions (or demes) that exchange migrants with their neighbors. Several studies have focused on the description of their demography, probability of extinction and expected patterns of diversity at different scales. Importantly, all these evolutionary processes may affect genetic diversity, which can affect the chance of populations to persist. In this chapter we provide an overview on the consequences of fragmentation, long-distance dispersal, range contractions and range shifts on genetic diversity. In addition, we describe new methods to detect and quantify underlying evolutionary processes from sampled genetic data.Laboratoire d’Excellence (LABEX) entitled TULIP: (ANR-10-LABX-41)

Habitat fragmentation and genetic diversity in natural populations of the Bornean elephant: Implications for conservation

Author: Ambu Laurentius N.
Ancrenaz Marc
Bruford Michael William
Chikhi Lounès
Goossens Benoit
Jue Nathaniel K.
Kun-Rodrigues Célia
O'Neill Rachel J.
Othman Nurzhafarina
Sakong Rosdi
Sharma Reeta
Publication venue: 'Elsevier BV'
Publication date: 23/02/2016
Field of study

Online Research @ Cardiff

Improved control strategy of DFIG-based wind turbines using direct torque and direct power control techniques

This paper presents different control strategies for a variable-speed wind energy conversion system (WECS), based on a doubly fed induction generator. Direct Torque Control (DTC) with Space-Vector Modulation is used on the rotor side converter. This control method is known to reduce the fluctuations of the torque and flux at low speeds in contrast to the classical DTC, where the frequency of switching is uncontrollable. The reference for torque is obtained from the maximum power point tracking technique of the wind turbine. For the grid-side converter, a fuzzy direct power control is proposed for the control of the instantaneous active and reactive power. Simulation results of the WECS are presented to compare the performance of the proposed and classical control approaches.Peer reviewedFinal Accepted Versio

University of Hertfordshire Research Archive

A Genealogical Interpretation of Principal Components Analysis

Author: AG Fix
AL Price
D Reich
G Barbujani
GA McVean
Gil McVean
HM Wilkinson-Herbots
J Baik
J Novembre
J Novembre
L Chikhi
LL Cavalli-Sforza
M Currat
M Slatkin
Molly Przeworski
N Patterson
P Debashis
S Klopfstein
S Schaffner
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Principal components analysis, PCA, is a statistical method commonly used in population genetics to identify structure in the distribution of genetic variation across geographical location and ethnic background. However, while the method is often used to inform about historical demographic processes, little is known about the relationship between fundamental demographic parameters and the projection of samples onto the primary axes. Here I show that for SNP data the projection of samples onto the principal components can be obtained directly from considering the average coalescent times between pairs of haploid genomes. The result provides a framework for interpreting PCA projections in terms of underlying processes, including migration, geographical isolation, and admixture. I also demonstrate a link between PCA and Wright's fst and show that SNP ascertainment has a largely simple and predictable effect on the projection of samples. Using examples from human genetics, I discuss the application of these results to empirical data and the implications for inference

Public Library of Science (PLOS)

Oxford University Research Archive

Wave-of-Advance Models of the Diffusion of the Y Chromosome Haplogroup R1b1b2 in Europe

Author: A Achilli
A Torroni
A Whittle
AJ Ammerman
AJ Ammerman
B Bramanti
Carles Lalueza-Fox
F Cruciani
F Di Giacomo
G Barbujani
G Barker
H Liu
I Dupanloup
IJ Wilson
J Chiaroni
J Diamond
L Chikhi
L Chikhi
L Morelli
L Simoni
LA Zhivotovsky
LA Zhivotovsky
LL Cavalli-Sforza
M Currat
M Richards
M Richards
MA Jobling
N Ray
NM Myres
O Francois
O François
O Semino
Olivier François
P Balaresque
P Menozzi
P Soares
Per Sjödin
TR Martin
W Haak
W Haak
W Shi
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Whether or not the spread of agriculture in Europe was accompanied by movements of people is a long-standing question in archeology and anthropology, which has been frequently addressed with the help of population genetic data. Estimates on dates of expansion and geographic origins obtained from genetic data are however sensitive to the calibration of mutation rates and to the mathematical models used to perform inference. For instance, recent data on the Y chromosome haplogroup R1b1b2 (M269) have either suggested a Neolithic origin for European paternal lineages or a more ancient Paleolithic origin depending on the calibration of Y-STR mutation rates. Here we examine the date of expansion and the geographic origin of hgR1b1b2 considering two current estimates of mutation rates in a total of fourteen realistic wave-of-advance models. We report that a range expansion dating to the Paleolithic is unlikely to explain the observed geographical distribution of microsatellite diversity, and that whether the data is informative with respect to the spread of agriculture in Europe depends on the mutation rate assumption in a critical way

CiteSeerX

Public Library of Science (PLOS)

Hal - Université Grenoble Alpes

Public Library of Science (PLOS)

Craniometric Data Supports Demic Diffusion Model for the Spread of Agriculture into Europe

Author: A Manica
A Whittle
AB Falsetti
AJ Ammerman
AJ Ammerman
AJ Ammerman
AK Scherer
C Perlès
CC Roseman
CC Roseman
DM Waddle
E Peltenburg
E Willerslev
EM Belle
G Barbujani
G Barbujani
H Harpending
HF Smith
I Dupanloup
I Kuijt
J Binladen
J Novembre
JH Relethford
JH Relethford
JH Relethford
John Relethford
K Harvati
K Harvati
L Betti
L Chikhi
L Chikhi
L Chikhi
L Thissen
LL Cavalli-Sforza
LW Konigsberg
LW Konigsberg
LW Konigsberg
M Currat
M Richards
M Richards
M Richards
M Zvelebil
M Zvelebil
M Özdoğan
M Özdoğan
MB Richards
ML Sampietro
MM Dow
N von Cramon-Taubadel
N von Cramon-Taubadel
N von Cramon-Taubadel
NA Mantel
Noreen von Cramon-Taubadel
O Semino
P Menozzi
PE Smouse
PE Smouse
R Dennell
R Dennell
R Gonzalez-Jose
R Martin
R Pinhasi
R Pinhasi
R Pinhasi
Ron Pinhasi
RR Sokal
RW Sinnott
S Pääbo
S Ramachandran
S Wright
S Wright
W Haak
WL Jungers
WW Howells
ZH Rosser
Publication venue: Public Library of Science
Publication date: 01/08/2009
Field of study

BACKGROUND:The spread of agriculture into Europe and the ancestry of the first European farmers have been subjects of debate and controversy among geneticists, archaeologists, linguists and anthropologists. Debates have centred on the extent to which the transition was associated with the active migration of people as opposed to the diffusion of cultural practices. Recent studies have shown that patterns of human cranial shape variation can be employed as a reliable proxy for the neutral genetic relationships of human populations. METHODOLOGY/PRINCIPAL FINDINGS:Here, we employ measurements of Mesolithic (hunter-gatherers) and Neolithic (farmers) crania from Southwest Asia and Europe to test several alternative population dispersal and hunter-farmer gene-flow models. We base our alternative hypothetical models on a null evolutionary model of isolation-by-geographic and temporal distance. Partial Mantel tests were used to assess the congruence between craniometric distance and each of the geographic model matrices, while controlling for temporal distance. Our results demonstrate that the craniometric data fit a model of continuous dispersal of people (and their genes) from Southwest Asia to Europe significantly better than a null model of cultural diffusion. CONCLUSIONS/SIGNIFICANCE:Therefore, this study does not support the assertion that farming in Europe solely involved the adoption of technologies and ideas from Southwest Asia by indigenous Mesolithic hunter-gatherers. Moreover, the results highlight the utility of craniometric data for assessing patterns of past population dispersal and gene flow

Composite likelihood estimation of demographic parameters

Author: A Gelman
A Keinan
AM Adams
B Charlesworth
BF Voight
BG Lindsay
C Becquet
C Wiuf
CJ Gilks
Daniel Garrigan
G McVean
H Li
J Wakeley
JC Fay
JD Wall
JD Wall
JE Pool
JP Hulsenbeck
LM Chikhi
MA Beaumont
MF Hammer
N Metropolis
N Takezaki
OE Gaggiotti
P Fearnhead
PW Hedrick
R Nielsen
RD Hernandez
RR Hudson
WK Hastings
WR Gilks
Y Kim
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Background: Most existing likelihood-based methods for fitting historical demographic models to DNA sequence polymorphism data to do not scale feasibly up to the level of whole-genome data sets. Computational economies can be achieved by incorporating two forms of pseudo-likelihood: composite and approximate likelihood methods. Composite likelihood enables scaling up to large data sets because it takes the product of marginal likelihoods as an estimator of the likelihood of the complete data set. This approach is especially useful when a large number of genomic regions constitutes the data set. Additionally, approximate likelihood methods can reduce the dimensionality of the data by summarizing the information in the original data by either a sufficient statistic, or a set of statistics. Both composite and approximate likelihood methods hold promise for analyzing large data sets or for use in situations where the underlying demographic model is complex and has many parameters. This paper considers a simple demographic model of allopatric divergence between two populations, in which one of the population is hypothesized to have experienced a founder event, or population bottleneck. A large resequencing data set from human populations is summarized by the joint frequency spectrum, which is a matrix of the genomic frequency spectrum of derived base frequencies in two populations. A Bayesia

CiteSeerX

Springer - Publisher Connector