13 research outputs found

    Shaped 3D Singular Spectrum Analysis for Quantifying Gene Expression, with Application to the Early Zebrafish Embryo

    No full text
    Recent progress in microscopy technologies, biological markers, and automated processing methods is making possible the development of gene expression atlases at cellular-level resolution over whole embryos. Raw data on gene expression is usually very noisy. This noise comes from both experimental (technical/methodological) and true biological sources (from stochastic biochemical processes). In addition, the cells or nuclei being imaged are irregularly arranged in 3D space. This makes the processing, extraction, and study of expression signals and intrinsic biological noise a serious challenge for 3D data, requiring new computational approaches. Here, we present a new approach for studying gene expression in nuclei located in a thick layer around a spherical surface. The method includes depth equalization on the sphere, flattening, interpolation to a regular grid, pattern extraction by Shaped 3D singular spectrum analysis (SSA), and interpolation back to original nuclear positions. The approach is demonstrated on several examples of gene expression in the zebrafish egg (a model system in vertebrate development). The method is tested on several different data geometries (e.g., nuclear positions) and different forms of gene expression patterns. Fully 3D datasets for developmental gene expression are becoming increasingly available; we discuss the prospects of applying 3D-SSA to data processing and analysis in this growing field

    Assessing the Significance of Peptide Spectrum Match Scores

    No full text
    Peptidic Natural Products (PNPs) are highly sought after bioactive compounds that include many antibiotic, antiviral and antitumor agents, immunosuppressors and toxins. Even though recent advancements in mass-spectrometry have led to the development of accurate sequencing methods for nonlinear (cyclic and branch-cyclic) peptides, requiring only picograms of input material, the identification of PNPs via a database search of mass spectra remains problematic. This holds particularly true when trying to evaluate the statistical significance of Peptide Spectrum Matches (PSM) especially when working with non-linear peptides that often contain non-standard amino acids, modifications and have an overall complex structure. In this paper we describe a new way of estimating the statistical significance of a PSM, defined by any peptide (including linear and non-linear), by using state-of-the-art Markov Chain Monte Carlo methods. In addition to the estimate itself our method also provides an uncertainty estimate in the form of confidence bounds, as well as an automatic simulation stopping rule that ensures that the sample size is sufficient to achieve the desired level of result accuracy

    Shaped Singular Spectrum Analysis for Quantifying Gene Expression, with Application to the Early Drosophila Embryo

    No full text
    In recent years, with the development of automated microscopy technologies, the volume and complexity of image data on gene expression have increased tremendously. The only way to analyze quantitatively and comprehensively such biological data is by developing and applying new sophisticated mathematical approaches. Here, we present extensions of 2D singular spectrum analysis (2D-SSA) for application to 2D and 3D datasets of embryo images. These extensions, circular and shaped 2D-SSA, are applied to gene expression in the nuclear layer just under the surface of the Drosophila (fruit fly) embryo. We consider the commonly used cylindrical projection of the ellipsoidal Drosophila embryo. We demonstrate how circular and shaped versions of 2D-SSA help to decompose expression data into identifiable components (such as trend and noise), as well as separating signals from different genes. Detection and improvement of under- and overcorrection in multichannel imaging is addressed, as well as the extraction and analysis of 3D features in 3D gene expression patterns

    Quantification reveals early dynamics in Drosophila maternal gradients

    No full text
    The Bicoid (Bcd) protein is a primary determinant of early anterior-posterior (AP) axis specification in Drosophila embryogenesis. This morphogen is spatially distributed in an anterior-high gradient, and affects particular AP cell fates in a concentration-dependent manner. The early distribution and dynamics of the bicoid (bcd) mRNA, the source for the Bcd protein gradient, is not well understood, leaving a number of open questions for how Bcd positional information develops and is regulated. Confocal microscope images of whole early embryos, stained for bcd mRNA or the Staufen (Stau) protein involved in its transport, were processed to extract quantitative AP intensity profiles at two depths (apical-under the embryo surface but above the nuclear layer; and basal-below the nuclei). Each profile was quantified by a two- (or three-) exponential equation. The parameters of these equations were used to analyze the early developmental dynamics of bcd. Analysis of 1D profiles was compared with 2D intensity surfaces from the same images. This approach reveals strong early changes in bcd and Stau, which appear to be coordinated. We can unambiguously discriminate three stages in early development using the exponential parameters: pre-blastoderm (1-9 cleavage cycle, cc), syncytial blastoderm (10-13 cc) and cellularization (from 14A cc). Key features which differ in this period are how fast the first exponential (anterior component) of the apical profile drops with distance and whether it is higher or lower than the basal first exponential. We can further discriminate early and late embryos within the pre-blastoderm stage, depending on how quickly the anterior exponential drops. This relates to the posterior-wards spread of bcd in the first hour of development. Both bcd and Stau show several redistributions in the head cytoplasm, quite probably related to nuclear activity: first shifting inwards towards the core plasm, forming either protrusions (early pre-blastoderm) or round aggregations (early nuclear cleavage cycles, cc, 13 and 14), then moving to the embryo surface and spreading posteriorly. These movements are seen both with the 2D surface study and the 1D profile analysis. The continued spreading of bcd can be tracked from the time of nuclear layer formation (later pre-blastoderm) to the later syncytial blastoderm stages by the progressive loss of steepness of the apical anterior exponential (for both bcd and Stau). Finally, at the beginning of cc14 (cellularization stage) we see a distinctive flip from the basal anterior gradient being higher to the apical gradient being higher (for both bcd and Stau). Quantitative analysis reveals substantial (and correlated) bcd and Stau redistributions during early development, supporting that the distribution and dynamics of bcd mRNA are key factors in the formation and maintenance of the Bcd protein morphogenetic gradient. This analysis reveals the complex and dynamic nature of bcd redistribution, particularly in the head cytoplasm. These resemble observations in oogenesis; their role and significance have yet to be clarified. The observed co-localization during redistribution of bcd and Stau may indicate the involvement of active transport

    Simulated barcoded Rep-seq datasets (IGH, barcode length: 15 nt)

    No full text
    <p>Simulated barcoded Rep-seq libraries with various amplification error rates for repertoire from doi.org/10.5281/zenodo.823351. Barcode errors, barcode collisions and chimeric reads are introduced into datasets. Barcodes are encoded in headers.</p
    corecore