242 research outputs found

    PoPoolation: A Toolbox for Population Genetic Analysis of Next Generation Sequencing Data from Pooled Individuals

    Get PDF
    Recent statistical analyses suggest that sequencing of pooled samples provides a cost effective approach to determine genome-wide population genetic parameters. Here we introduce PoPoolation, a toolbox specifically designed for the population genetic analysis of sequence data from pooled individuals. PoPoolation calculates estimates of θWatterson, θπ, and Tajima's D that account for the bias introduced by pooling and sequencing errors, as well as divergence between species. Results of genome-wide analyses can be graphically displayed in a sliding window plot. PoPoolation is written in Perl and R and it builds on commonly used data formats. Its source code can be downloaded from http://code.google.com/p/popoolation/. Furthermore, we evaluate the influence of mapping algorithms, sequencing errors, and read coverage on the accuracy of population genetic parameter estimates from pooled data

    Search for time-dependent B0s - B0s-bar oscillations using a vertex charge dipole technique

    Get PDF
    We report a search for B0s - B0s-bar oscillations using a sample of 400,000 hadronic Z0 decays collected by the SLD experiment. The analysis takes advantage of the electron beam polarization as well as information from the hemisphere opposite that of the reconstructed B decay to tag the B production flavor. The excellent resolution provided by the pixel CCD vertex detector is exploited to cleanly reconstruct both B and cascade D decay vertices, and tag the B decay flavor from the charge difference between them. We exclude the following values of the B0s - B0s-bar oscillation frequency: Delta m_s < 4.9 ps-1 and 7.9 < Delta m_s < 10.3 ps-1 at the 95% confidence level.Comment: 18 pages, 3 figures, replaced by version accepted for publication in Phys.Rev.D; results differ slightly from first versio

    Why is the Winner the Best?

    Get PDF
    International benchmarking competitions have become fundamental for the comparative performance assessment of image analysis methods. However, little attention has been given to investigating what can be learnt from these competitions. Do they really generate scientific progress? What are common and successful participation strategies? What makes a solution superior to a competing method? To address this gap in the literature, we performed a multicenter study with all 80 competitions that were conducted in the scope of IEEE ISBI 2021 and MICCAI 2021. Statistical analyses performed based on comprehensive descriptions of the submitted algorithms linked to their rank as well as the underlying participation strategies revealed common characteristics of winning solutions. These typically include the use of multi-task learning (63%) and/or multi-stage pipelines (61%), and a focus on augmentation (100%), image preprocessing (97%), data curation (79%), and post-processing (66%). The “typical” lead of a winning team is a computer scientist with a doctoral degree, five years of experience in biomedical image analysis, and four years of experience in deep learning. Two core general development strategies stood out for highly-ranked teams: the reflection of the metrics in the method design and the focus on analyzing and handling failure cases. According to the organizers, 43% of the winning algorithms exceeded the state of the art but only 11% completely solved the respective domain problem. The insights of our study could help researchers (1) improve algorithm development strategies when approaching new problems, and (2) focus on open research questions revealed by this work

    Author Correction: Federated learning enables big data for rare cancer boundary detection.

    Get PDF

    Lack of Galanin 3 Receptor Aggravates Murine Autoimmune Arthritis

    Get PDF
    Neurogenic inflammation mediated by peptidergic sensory nerves has a crucial impact on the pathogenesis of various joint diseases. Galanin is a regulatory sensory neuropeptide, which has been shown to attenuate neurogenic inflammation, modulate neutrophil activation, and be involved in the development of adjuvant arthritis, but our current understanding about its targets and physiological importance is incomplete. Among the receptors of galanin (GAL1-3), GAL3 has been found to be the most abundantly expressed in the vasculature and on the surface of some immune cells. However, since there are minimal in vivo data on the role of GAL3 in joint diseases, we analyzed its involvement in different inflammatory mechanisms of the K/BxN serum transfer-model of autoimmune arthritis employing GAL 3 gene-deficient mice. After arthritis induction, GAL3 knockouts demonstrated increased clinical disease severity and earlier hindlimb edema than wild types. Vascular hyperpermeability determined by in vivo fluorescence imaging was also elevated compared to the wild-type controls. However, neutrophil accumulation detected by in vivo luminescence imaging or arthritic mechanical hyperalgesia was not altered by the lack of the GAL3 receptor. Our findings suggest that GAL3 has anti-inflammatory properties in joints by inhibiting vascular hyperpermeability and consequent edema formation

    Why is the winner the best?

    Get PDF
    International benchmarking competitions have become fundamental for the comparative performance assessment of image analysis methods. However, little attention has been given to investigating what can be learnt from these competitions. Do they really generate scientific progress? What are common and successful participation strategies? What makes a solution superior to a competing method? To address this gap in the literature, we performed a multicenter study with all 80 competitions that were conducted in the scope of IEEE ISBI 2021 and MICCAI 2021. Statistical analyses performed based on comprehensive descriptions of the submitted algorithms linked to their rank as well as the underlying participation strategies revealed common characteristics of winning solutions. These typically include the use of multi-task learning (63%) and/or multi-stage pipelines (61%), and a focus on augmentation (100%), image preprocessing (97%), data curation (79%), and post-processing (66%). The 'typical' lead of a winning team is a computer scientist with a doctoral degree, five years of experience in biomedical image analysis, and four years of experience in deep learning. Two core general development strategies stood out for highly-ranked teams: the reflection of the metrics in the method design and the focus on analyzing and handling failure cases. According to the organizers, 43% of the winning algorithms exceeded the state of the art but only 11% completely solved the respective domain problem. The insights of our study could help researchers (1) improve algorithm development strategies when approaching new problems, and (2) focus on open research questions revealed by this work

    SNP calling by sequencing pooled samples

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Performing high throughput sequencing on samples pooled from different individuals is a strategy to characterize genetic variability at a small fraction of the cost required for individual sequencing. In certain circumstances some variability estimators have even lower variance than those obtained with individual sequencing. SNP calling and estimating the frequency of the minor allele from pooled samples, though, is a subtle exercise for at least three reasons. First, sequencing errors may have a much larger relevance than in individual SNP calling: while their impact in individual sequencing can be reduced by setting a restriction on a minimum number of reads per allele, this would have a strong and undesired effect in pools because it is unlikely that alleles at low frequency in the pool will be read many times. Second, the prior allele frequency for heterozygous sites in individuals is usually 0.5 (assuming one is not analyzing sequences coming from, <it>e.g.</it> cancer tissues), but this is not true in pools: in fact, under the standard neutral model, singletons (<it>i.e.</it> alleles of minimum frequency) are the most common class of variants because <it>P</it>(<it>f</it>) ∝ 1/<it>f </it>and they occur more often as the sample size increases. Third, an allele appearing only once in the reads from a pool does not necessarily correspond to a singleton in the set of individuals making up the pool, and vice versa, there can be more than one read – or, more likely, none – from a true singleton.</p> <p>Results</p> <p>To improve upon existing theory and software packages, we have developed a Bayesian approach for minor allele frequency (MAF) computation and SNP calling in pools (and implemented it in a program called <monospace>snape</monospace>): the approach takes into account sequencing errors and allows users to choose different priors. We also set up a pipeline which can simulate the coalescence process giving rise to the SNPs, the pooling procedure and the sequencing. We used it to compare the performance of <monospace>snape</monospace> to that of other packages.</p> <p>Conclusions</p> <p>We present a software which helps in calling SNPs in pooled samples: it has good power while retaining a low false discovery rate (FDR). The method also provides the posterior probability that a SNP is segregating and the full posterior distribution of <it>f</it> for every SNP. In order to test the behaviour of our software, we generated (through simulated coalescence) artificial genomes and computed the effect of a pooled sequencing protocol, followed by SNP calling. In this setting, <monospace>snape</monospace> has better power and False Discovery Rate (FDR) than the comparable packages <monospace>samtools</monospace>, <monospace>PoPoolation</monospace>, <monospace>Varscan</monospace> : for <it>N </it>= 50 chromosomes, <monospace>snape</monospace> has power ≈ 35<it>%</it>and FDR ≈ 2.5<it>%</it>. <monospace>snape</monospace> is available at <url>http://code.google.com/p/snape-pooled/</url> (source code and precompiled binaries).</p

    Population Genomics on the Fly: Recent Advances in Drosophila

    Get PDF
    Drosophila melanogaster, a small dipteran of African origin, represents one of the best-studied model organisms. Early work in this system has uniquely shed light on the basic principles of genetics and resulted in a versatile collection of genetic tools that allow to uncover mechanistic links between genotype and phenotype. Moreover, given its worldwide distribution in diverse habitats and its moderate genome-size, Drosophila has proven very powerful for population genetics inference and was one of the first eukaryotes whose genome was fully sequenced. In this book chapter, we provide a brief historical overview of research in Drosophila and then focus on recent advances during the genomic era. After describing different types and sources of genomic data, we discuss mechanisms of neutral evolution including the demographic history of Drosophila and the effects of recombination and biased gene conversion. Then, we review recent advances in detecting genome-wide signals of selection, such as soft and hard selective sweeps. We further provide a brief introduction to background selection, selection of noncoding DNA and codon usage and focus on the role of structural variants, such as transposable elements and chromosomal inversions, during the adaptive process. Finally, we discuss how genomic data helps to dissect neutral and adaptive evolutionary mechanisms that shape genetic and phenotypic variation in natural populations along environmental gradients. In summary, this book chapter serves as a starting point to Drosophila population genomics and provides an introduction to the system and an overview to data sources, important population genetic concepts and recent advances in the field
    corecore