    Quantifying the Length and Variance of the Eukaryotic Cell Cycle Phases by a Stochastic Model and Dual Nucleoside Pulse Labelling

    A fundamental property of cell populations is their growth rate as well as the time needed for cell division and its variance. The eukaryotic cell cycle progresses in an ordered sequence through the phases G(1), S, G(2), and M, and is regulated by environmental cues and by intracellular checkpoints. Reflecting this regulatory complexity, the length of each phase varies considerably in different kinds of cells but also among genetically and morphologically indistinguishable cells. This article addresses the question of how to describe and quantify the mean and variance of the cell cycle phase lengths. A phase-resolved cell cycle model is introduced assuming that phase completion times are distributed as delayed exponential functions, capturing the observations that each realization of a cycle phase is variable in length and requires a minimal time. In this model, the total cell cycle length is distributed as a delayed hypoexponential function that closely reproduces empirical distributions. Analytic solutions are derived for the proportions of cells in each cycle phase in a population growing under balanced growth and under specific non-stationary conditions. These solutions are then adapted to describe conventional cell cycle kinetic assays based on pulse labelling with nucleoside analogs. The model fits well to data obtained with two distinct proliferating cell lines labelled with a single bromodeoxiuridine pulse. However, whereas mean lengths are precisely estimated for all phases, the respective variances remain uncertain. To overcome this limitation, a redesigned experimental protocol is derived and validated in silico. The novelty is the timing of two consecutive pulses with distinct nucleosides that enables accurate and precise estimation of both the mean and the variance of the length of all phases. The proposed methodology to quantify the phase length distributions gives results potentially equivalent to those obtained with modern phase-specific biosensor-based fluorescent imaging

    Genome-wide Analyses Identify KIF5A as a Novel ALS Gene

    To identify novel genes associated with ALS, we undertook two lines of investigation. We carried out a genome-wide association study comparing 20,806 ALS cases and 59,804 controls. Independently, we performed a rare variant burden analysis comparing 1,138 index familial ALS cases and 19,494 controls. Through both approaches, we identified kinesin family member 5A (KIF5A) as a novel gene associated with ALS. Interestingly, mutations predominantly in the N-terminal motor domain of KIF5A are causative for two neurodegenerative diseases: hereditary spastic paraplegia (SPG10) and Charcot-Marie-Tooth type 2 (CMT2). In contrast, ALS-associated mutations are primarily located at the C-terminal cargo-binding tail domain and patients harboring loss-of-function mutations displayed an extended survival relative to typical ALS cases. Taken together, these results broaden the phenotype spectrum resulting from mutations in KIF5A and strengthen the role of cytoskeletal defects in the pathogenesis of ALS.Peer reviewe

    26th Annual Computational Neuroscience Meeting (CNS*2017): Part 3 - Meeting Abstracts - Antwerp, Belgium. 15–20 July 2017

    This work was produced as part of the activities of FAPESP Research,\ud Disseminations and Innovation Center for Neuromathematics (grant\ud 2013/07699-0, S. Paulo Research Foundation). NLK is supported by a\ud FAPESP postdoctoral fellowship (grant 2016/03855-5). ACR is partially\ud supported by a CNPq fellowship (grant 306251/2014-0)

    Stability analysis.

    <p> as a function of for fixed values of For (green circle) the real part of Q takes, depending on a value in the interval The values for x are increasing from A-D, while and remain unchanged. For relatively low values of (A-B) the real part is positive for After one or several turns, i.e by increasing the spiral can potentially cross the origin only once (empty circle). In A the spiral misses the origin, while in B the spiral crosses the origin after one turn. Crossing of the origin means that the corresponding complex number is a root of Q. In C the spiral starts at the origin. This represents the only real positive root of Q. For initially negative values of (D) the spiral can never cross the origin because the distance to the center point (gray circle) is already in the beginning for larger than the distance between the latter and the origin. By increasing y this distance will even grow further according to Eq. 33.</p

    Analysis of simulated dual pulse labelling data.

    <p><b>A</b>: Average kinetics of unlabelled (dashed line) and labelled cell cohorts (colored lines) were computed from Eq. 25, using ML parameter estimates from the U87 and the V79 data sets (U87: V79: units are hours). Support points and repeats were chosen according to the real experiments. Multinomial noise was added, mimicking the residuals found in the original data sets (see the Computational Methods section for more details). Finally, model solutions (lines) were fitted to the synthetic data sets (triangles). Best fit parameters (U87: V79: units are hours) <b>B</b>: ML parameter estimates from simulated data. All ML regions converge to point estimates (arrows). Squares indicate parameters used for generating the data (see A). <b>C</b>: Bayesian bi-variate 99%-credibility regions for the parameters and for each phase, based on the artificial data.</p

    DAPI-BrdU pulse-chase labelling FACS data.

    <p>Samples taken at several time points after pulse labelling proliferating U87 human glioblastoma cells with The four gated populations are and which are defined precisely in the main text. Briefly, the subscript indicates the phase at the instant of labelling, while the superscripts ‘u’, ‘lu’ and ‘ld’ refers to cells ‘unlabelled’, ‘labelled and undivided’ and ‘labelled and divided’, respectively. The data was generated as described in the Experimental Methods section.</p

    Dual pulse protocol.

    <p><b>A</b>: Simplified schematic representations of the protocols corresponding to a conventional single pulse labelling with one nucleoside analog (e.g., BrdU) and a dual pulse labelling experiment with two different nucleoside analogs (e.g., BrdU together IdU or EdU). <b>B</b>: Artificial staining of single-pulse labelling data (for original data see <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1003616#pcbi-1003616-g002" target="_blank">Fig. 2</a>), showing eight of the nine subpopulations that could potentially be identified with double-pulse labelling. Notice that the four population and that can be followed by the conventional protocol, have each been subdivided according to the cell cycle phases. The naming convention for the populations is as follows: the superscript ( = ‘labelled undivided’,  = ‘labelled divided’,  = ‘unlabelled’) indicates whether the population is labelled and whether it has divided since the time of the first pulse; the first and the second subscript () stand for the phase in which the population was at the time of the first and the second pulse respectively. Double subscripts are used only when necessary.</p

    Model based parameter estimation.

    <p><b>A</b>: Best fit of the model predictions (lines) to experimentally determined cell fractions after BrdU pulse labelling (dots). U87: In vitro cultured U87 human glioblastoma cancer cell line (three replicates). V79: In vitro cultured V79 Chinese hamster cells (single replicate) (courtesy G. Wilson). Best fit parameter values used to compute model predictions (U87: V79: units are hours). <b>B</b>: Approximate ML regions for the parameters and associated to each phase (gray: red: green: ). <b>C</b>: Bayesian bi-variate 99%-credibility regions for the parameters and for each phase. Arrows indicate point estimates and the dashed lines delineate the information that could have been gained in our <i>thought</i> experiment under noise-free conditions from two support points, one at and a second at . The U87 data set was generated as described in the Experimental Methods section. The V79 data set was a kind gift of G. Wilson.</p