Search CORE

1,018 research outputs found

Recommended from our members

Inference of single-cell phylogenies from lineage tracing data using Cassiopeia.

Author: Chan Michelle M
Hussmann Jeffrey A
Jones Matthew G
Khodaverdian Alex
Quinn Jeffrey J
Wang Robert
Weissman Jonathan S
Xu Chenling
Yosef Nir
Publication venue: eScholarship, University of California
Publication date: 01/04/2020
Field of study

The pairing of CRISPR/Cas9-based gene editing with massively parallel single-cell readouts now enables large-scale lineage tracing. However, the rapid growth in complexity of data from these assays has outpaced our ability to accurately infer phylogenetic relationships. First, we introduce Cassiopeia-a suite of scalable maximum parsimony approaches for tree reconstruction. Second, we provide a simulation framework for evaluating algorithms and exploring lineage tracer design principles. Finally, we generate the most complex experimental lineage tracing dataset to date, 34,557 human cells continuously traced over 15 generations, and use it for benchmarking phylogenetic inference approaches. We show that Cassiopeia outperforms traditional methods by several metrics and under a wide variety of parameter regimes, and provide insight into the principles for the design of improved Cas9-enabled recorders. Together, these should broadly enable large-scale mammalian lineage tracing efforts. Cassiopeia and its benchmarking resources are publicly available at www.github.com/YosefLab/Cassiopeia

eScholarship - University of California

Error, bias, and long-branch attraction in data for two chloroplast photosystem genes in seed plants

Author: Brady Siobhan
Hu J.M.
Sanderson Michael
Scharaschkin Tanya
Wojciechowski Martin
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2000
Field of study

Sequences of two chloroplast photosystem genes, psaA and psbB, together comprising about 3,500 bp, were obtained for all five major groups of extant seed plants and several outgroups among other vascular plants. Strongly supported, but significantly conflicting, phylogenetic signals were obtained in parsimony analyses from partitions of the data into first and second codon positions versus third positions. In the former, both genes agreed on a monophyletic gymnosperms, with Gnetales closely related to certain conifers. In the latter, Gnetales are inferred to be the sister group of all other seed plants, with gymnosperms paraphyletic. None of the data supported the modern ‘‘anthophyte hypothesis,’’ which places Gnetales as the sister group of flowering plants. A series of simulation studies were undertaken to examine the error rate for parsimony inference. Three kinds of errors were examined: random error, systematic bias (both properties of finite data sets), and statistical inconsistency owing to long-branch attraction (an asymptotic property). Parsimony reconstructions were extremely biased for third-position data for psbB. Regardless of the true underlying tree, a tree in which Gnetales are sister to all other seed plants was likely to be reconstructed for these data. None of the combinations of genes or partitions permits the anthophyte tree to be reconstructed with high probability. Simulations of progressively larger data sets indicate the existence of long-branch attraction (statistical inconsistency) for third-position psbB data if either the anthophyte tree or the gymnosperm tree is correct. This is also true for the anthophyte tree using either psaA third positions or psbB first and second positions. A factor contributing to bias and inconsistency is extremely short branches at the base of the seed plant radiation, coupled with extremely high rates in Gnetales and nonseed plant outgroups. M. J. Sanderson,* M. F. Wojciechowski,*† J.-M. Hu,* T. Sher Khan,* and S. G. Brad

Queensland University of Technology ePrints Archive

Multivariate Approaches to Classification in Extragalactic Astronomy

Author: Chattopadhyay Asis Kumar
Fraix-Burnet Didier
Thuillard Marc
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2015
Field of study

Clustering objects into synthetic groups is a natural activity of any science. Astrophysics is not an exception and is now facing a deluge of data. For galaxies, the one-century old Hubble classification and the Hubble tuning fork are still largely in use, together with numerous mono-or bivariate classifications most often made by eye. However, a classification must be driven by the data, and sophisticated multivariate statistical tools are used more and more often. In this paper we review these different approaches in order to situate them in the general context of unsupervised and supervised learning. We insist on the astrophysical outcomes of these studies to show that multivariate analyses provide an obvious path toward a renewal of our classification of galaxies and are invaluable tools to investigate the physics and evolution of galaxies.Comment: Open Access paper. http://www.frontiersin.org/milky\_way\_and\_galaxies/10.3389/fspas.2015.00003/abstract\>. \<10.3389/fspas.2015.00003 \&g

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

Frontiers - Publisher Connector

HAL Descartes

HAL-INSU

HAL Université de Savoie

Bayesian Phylogeography Finds Its Roots

Author: A Chalmers
A Drummond
A Drummond
A Drummond
A Drummond
A Rambaut
A Templeton
AJ Drummond
Alexei J. Drummond
AM Kilpatrick
Andrew Rambaut
C Cunningham
C Talbi
C Viboud
CA Russell
Christophe Fraser
D Vijaykrishna
DL Knobel
E Holmes
E Ranta
F Ronquist
H Bourhy
H Bourhy
H Chen
H Chipman
I Sanmartin
J Felsenstein
J Kingman
J Parker
J Wang
K Hampson
K Subbarao
L Knowles
L Kuo
M Hasegawa
M Pagel
M Pagel
M Slatkin
Marc A. Suchard
O Bjornstad
O Pybus
P Liò
Philippe Lemey
PL Davis
R Anderson
R Lanciotti
R Wallace
RG Wallace
S Kullback
S Zarate
SJ Olsen
SLK Pond
T Jukes
TH Wang
V Minin
WM D Swofford
WM Fitch
X Xu
Z Gilulua
Z Yang
Publication venue: Public Library of Science
Publication date: 01/09/2009
Field of study

As a key factor in endemic and epidemic dynamics, the geographical distribution of viruses has been frequently interpreted in the light of their genetic histories. Unfortunately, inference of historical dispersal or migration patterns of viruses has mainly been restricted to model-free heuristic approaches that provide little insight into the temporal setting of the spatial dynamics. The introduction of probabilistic models of evolution, however, offers unique opportunities to engage in this statistical endeavor. Here we introduce a Bayesian framework for inference, visualization and hypothesis testing of phylogeographic history. By implementing character mapping in a Bayesian software that samples time-scaled phylogenies, we enable the reconstruction of timed viral dispersal patterns while accommodating phylogenetic uncertainty. Standard Markov model inference is extended with a stochastic search variable selection procedure that identifies the parsimonious descriptions of the diffusion process. In addition, we propose priors that can incorporate geographical sampling distributions or characterize alternative hypotheses about the spatial dynamics. To visualize the spatial and temporal information, we summarize inferences using virtual globe software. We describe how Bayesian phylogeography compares with previous parsimony analysis in the investigation of the influenza A H5N1 origin and H5N1 epidemiological linkage among sampling localities. Analysis of rabies in West African dog populations reveals how virus diffusion may enable endemic maintenance through continuous epidemic cycles. From these analyses, we conclude that our phylogeographic framework will make an important asset in molecular epidemiology that can be easily generalized to infer biogeogeography from genetic data for many organisms

Lirias

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

eScholarship - University of California

The compositional and evolutionary logic of metabolism

Author: Alberts B
Andreini C
Balche W E
Barker H A
Benner S A
Berge C
Braakman R
Brazelton W J
Buss L W
Collins M D
Copley S D
Dagley S
Darwin C
de Duve C
Doyle J
Eric Smith
Erwin D H
Feist A M
Fenchel T
Fisher R A
Fry I
Gerhart J
Gesteland R F
Gould S J
Gray H B
Guiral M
Haldane J B S
Kallen R G
Kauffman S
Kim J
Lengeler J W
Ljungdahl L
MacKenzie R E
Martin W
Massey L K
Metzler D E
Morowitz H J
Morowitz H J
Morowitz H J
Nelson D L
Nitscke W
Odling-Smee F J
Oparin A I
Petsko G A
Puigbo P
Ragsdale S
Ragsdale S W
Rankama K
Razin S
Redfield A C
Redfield A C
Rogier Braakman
Russell M J
Schrödinger E
Schuster P
Segré D
Shapiro R
Simon H A
Simon H A
Smith E
Srinivasan V
Srinivasan V
Sterner R W
Stryer L
Szent-Gyorgyi A
Tazuya K
Utter M F
Vogels G D
von Wettstein D
Vorholt J A
Waber L J
Wald G
Wilson E B
Wächtershäuser G
Yarus M
Yi L
Publication venue: 'IOP Publishing'
Publication date: 30/10/2012
Field of study

Metabolism displays striking and robust regularities in the forms of modularity and hierarchy, whose composition may be compactly described. This renders metabolic architecture comprehensible as a system, and suggests the order in which layers of that system emerged. Metabolism also serves as the foundation in other hierarchies, at least up to cellular integration including bioenergetics and molecular replication, and trophic ecology. The recapitulation of patterns first seen in metabolism, in these higher levels, suggests metabolism as a source of causation or constraint on many forms of organization in the biosphere. We identify as modules widely reused subsets of chemicals, reactions, or functions, each with a conserved internal structure. At the small molecule substrate level, module boundaries are generally associated with the most complex reaction mechanisms and the most conserved enzymes. Cofactors form a structurally and functionally distinctive control layer over the small-molecule substrate. Complex cofactors are often used at module boundaries of the substrate level, while simpler ones participate in widely used reactions. Cofactor functions thus act as "keys" that incorporate classes of organic reactions within biochemistry. The same modules that organize the compositional diversity of metabolism are argued to have governed long-term evolution. Early evolution of core metabolism, especially carbon-fixation, appears to have required few innovations among a small number of conserved modules, to produce adaptations to simple biogeochemical changes of environment. We demonstrate these features of metabolism at several levels of hierarchy, beginning with the small-molecule substrate and network architecture, continuing with cofactors and key conserved reactions, and culminating in the aggregation of multiple diverse physical and biochemical processes in cells.Comment: 56 pages, 28 figure

arXiv.org e-Print Archive

Crossref

Parsimony and likelihood reconstruction of human segmental duplications

Author: Alkan
Bailey
Benjamin J. Raphael
Blekhman
Borislav H. Hristov
Chaudhuri
Crystal L. Kahn
El-Mabrouk
Ergun
Jiang
Kahn
Kahn
Kahn
Lajoie
Marron
McCaskill
Price
Sankoff
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: Segmental duplications > 1 kb in length with ≥ 90% sequence identity between copies comprise nearly 5% of the human genome. They are frequently found in large, contiguous regions known as duplication blocks that can contain mosaic patterns of thousands of segmental duplications. Reconstructing the evolutionary history of these complex genomic regions is a non-trivial, but important task

Crossref

PubMed Central