143 research outputs found

    Sampling constrained probability distributions using Spherical Augmentation

    Full text link
    Statistical models with constrained probability distributions are abundant in machine learning. Some examples include regression models with norm constraints (e.g., Lasso), probit, many copula models, and latent Dirichlet allocation (LDA). Bayesian inference involving probability distributions confined to constrained domains could be quite challenging for commonly used sampling algorithms. In this paper, we propose a novel augmentation technique that handles a wide range of constraints by mapping the constrained domain to a sphere in the augmented space. By moving freely on the surface of this sphere, sampling algorithms handle constraints implicitly and generate proposals that remain within boundaries when mapped back to the original space. Our proposed method, called {Spherical Augmentation}, provides a mathematically natural and computationally efficient framework for sampling from constrained probability distributions. We show the advantages of our method over state-of-the-art sampling algorithms, such as exact Hamiltonian Monte Carlo, using several examples including truncated Gaussian distributions, Bayesian Lasso, Bayesian bridge regression, reconstruction of quantized stationary Gaussian process, and LDA for topic modeling.Comment: 41 pages, 13 figure

    Stochastic Blockmodeling for the Analysis of Big Data

    Get PDF
    The aim of this paper is to consider the stochastic blockmodel to obtain clusters of units as regards patterns of similar relations; moreover we want to analyze the relations between clusters. Blockmodeling is a technique usually applied in social network analysis focusing on the relations between \u201cactors\u201d i.e. units. In our time people and devices constantly generate data. The network is generating location and other data that keeps services running and ready to use in every moment. This rapid development in the availability and access to data has induced the need for better analysis techniques to understand the various phenomena. Blockmodeling techniques and Clustering algorithms, can be used for this aim. In this paper application regards the Web

    Composite likelihood estimation of demographic parameters

    Get PDF
    which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Background: Most existing likelihood-based methods for fitting historical demographic models to DNA sequence polymorphism data to do not scale feasibly up to the level of whole-genome data sets. Computational economies can be achieved by incorporating two forms of pseudo-likelihood: composite and approximate likelihood methods. Composite likelihood enables scaling up to large data sets because it takes the product of marginal likelihoods as an estimator of the likelihood of the complete data set. This approach is especially useful when a large number of genomic regions constitutes the data set. Additionally, approximate likelihood methods can reduce the dimensionality of the data by summarizing the information in the original data by either a sufficient statistic, or a set of statistics. Both composite and approximate likelihood methods hold promise for analyzing large data sets or for use in situations where the underlying demographic model is complex and has many parameters. This paper considers a simple demographic model of allopatric divergence between two populations, in which one of the population is hypothesized to have experienced a founder event, or population bottleneck. A large resequencing data set from human populations is summarized by the joint frequency spectrum, which is a matrix of the genomic frequency spectrum of derived base frequencies in two populations. A Bayesia

    Transformation of the rodent malaria parasite Plasmodium chabaudi and generation of a stable fluorescent line PcGFPCON

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The rodent malaria parasite <it>Plasmodium chabaudi </it>has proven of great value in the analysis of fundamental aspects of host-parasite-vector interactions implicated in disease pathology and parasite evolutionary ecology. However, the lack of gene modification technologies for this model has precluded more direct functional studies.</p> <p>Methods</p> <p>The development of <it>in vitro </it>culture methods to yield <it>P. chabaudi </it>schizonts for transfection and conditions for genetic modification of this rodent malaria model are reported.</p> <p>Results</p> <p>Independent <it>P. chabaudi </it>gene-integrant lines that constitutively express high levels of green fluorescent protein throughout their life cycle have been generated.</p> <p>Conclusion</p> <p>Genetic modification of <it>P. chabaudi </it>is now possible. The production of genetically distinct reference lines offers substantial advances to our understanding of malaria parasite biology, especially interactions with the immune system during chronic infection.</p

    Loss of SMARCA4 (BRG1) protein expression as determined by immunohistochemistry in small-cell carcinoma of the ovary, hypercalcaemic type distinguishes these tumours from their mimics

    Get PDF
    AIMS: Molecular investigation of small-cell carcinoma of the ovary, hypercalcaemic type (SCCOHT) has revealed that it is a monogenetic tumour characterized by alteration of SMARCA4 (BRG1), encoding a member of the switch/sucrose non-fermentable (SWI/SNF) chromatin remodelling complex. A large majority of cases show loss of expression of the corresponding SMARCA4/BRG1 protein. Furthermore, three cases of SCCOHT with retained SMARCA4 protein expression showed loss of SMARCB1/INI1 expression. The aim of this study was to assess the sensitivity and specificity of loss of SMARCA4 expression as a diagnostic test for SCCOHT. METHODS AND RESULTS: We performed SMARCA4 and SMARCB1 staining in 245 tumours, many of which were potentially in the differential diagnosis of SCCOHT. We also stained 56 cases of SCCOHT for SMARCA4 and 37 of these for SMARCB1. Fifty-four of the SCCOHT cases showed complete absence of SMARCA4 expression. The two cases with retained expression showed molecular alteration of SMARCA4. Of the 217 other neoplasms with interpretable staining, all retained SMARCA4 expression. Although the majority showed diffuse, strong nuclear expression, a heterogeneous, typically weak staining pattern was present in 13% of cases. All 37 cases of SCCOHT tested and all other neoplasms, apart from three malignant rhabdoid tumours, showed retained nuclear SMARCB1 expression. Loss of SMARCA4 expression had a sensitivity of 96.55% and specificity of 100%. CONCLUSIONS: Loss of SMARCA4 expression is sensitive and specific for SCCOHT. Although some mimics show heterogeneous expression, there is retention of nuclear staining in at least a part of the tumour; therefore, only complete loss of staining should be regarded as being supportive of SCCOHT

    Consolidating emerging evidence surrounding HIVST and HIVSS: A rapid systematic mapping protocol

    Get PDF
    BACKGROUND: HIV self-testing (HIVST) is becoming popular with policy makers and commissioners globally, with a key aim of expanding access through reducing barriers to testing for individuals at risk of HIV infection. HIV self-sampling (HIVSS) was available previously to self-testing but was confined mainly to the USA and the UK. It remains to be seen whether the momentum behind HIVST will also energise efforts to expand HIVSS. Recent years have seen a rapid growth in the type of evidence related to these interventions as well as several systematic reviews. The vast majority of this evidence relates to acceptability as well as values and preferences, although new types of evidence are emerging. This systematic map aims to consolidate all emerging evidence related to HIVST and HIVSS to respond to this rapidly changing area. METHODS: We will systematically search databases and the abstracts of five conferences from 2006 to the present date, with monthly-automated database searches. Searches will combine key terms relating to HIV (e.g. HIV, AIDS, human immune-deficiency syndrome) with terms related to self-testing (e.g. home-test, self-test, mail-test, home dried blood spot test). Abstracts will be reviewed against inclusion criteria in duplicate. Data will be manually extracted through a standard form and then entered to an open access relational map (HIVST.org). When new and sufficient evidence emerges which addresses existing knowledge gaps, we will complete a review on a relevant topic. DISCUSSION: This innovative approach will allow rapid cataloguing, documenting and dissemination of new evidence and key findings as they emerge into the public domain. SYSTEMATIC REVIEW REGISTRATION: This protocol has not been registered with PROSPERO as they do not register systematic maps

    BABAR: an R package to simplify the normalisation of common reference design microarray-based transcriptomic datasets

    Get PDF
    Background: The development of DNA microarrays has facilitated the generation of hundreds of thousands of transcriptomic datasets. The use of a common reference microarray design allows existing transcriptomic data to be readily compared and re-analysed in the light of new data, and the combination of this design with large datasets is ideal for 'systems' level analyses. One issue is that these datasets are typically collected over many years and may be heterogeneous in nature, containing different microarray file formats and gene array layouts, dye-swaps, and showing varying scales of log(2)- ratios of expression between microarrays. Excellent software exists for the normalisation and analysis of microarray data but many data have yet to be analysed as existing methods struggle with heterogeneous datasets; options include normalising microarrays on an individual or experimental group basis. Our solution was to develop the Batch Anti-Banana Algorithm in R (BABAR) algorithm and software package which uses cyclic loess to normalise across the complete dataset. We have already used BABAR to analyse the function of Salmonella genes involved in the process of infection of mammalian cells. Results: The only input required by BABAR is unprocessed GenePix or BlueFuse microarray data files. BABAR provides a combination of 'within' and 'between' microarray normalisation steps and diagnostic boxplots. When applied to a real heterogeneous dataset, BABAR normalised the dataset to produce a comparable scaling between the microarrays, with the microarray data in excellent agreement with RT-PCR analysis. When applied to a real non-heterogeneous dataset and a simulated dataset, BABAR's performance in identifying differentially expressed genes showed some benefits over standard techniques. Conclusions: BABAR is an easy-to-use software tool, simplifying the simultaneous normalisation of heterogeneous two-colour common reference design cDNA microarray-based transcriptomic datasets. We show BABAR transforms real and simulated datasets to allow for the correct interpretation of these data, and is the ideal tool to facilitate the identification of differentially expressed genes or network inference analysis from transcriptomic datasets

    Candidate biomarkers of PARP inhibitor sensitivity in ovarian cancer beyond the BRCA genes

    Get PDF
    BACKGROUND: Olaparib (Lynparza™) is a PARP inhibitor approved for advanced BRCA-mutated (BRCAm) ovarian cancer. PARP inhibitors may benefit patients whose tumours are dysfunctional in DNA repair mechanisms unrelated to BRCA1/2. We report exploratory analyses, including the long-term outcome of candidate biomarkers of sensitivity to olaparib in BRCA wild-type (BRCAwt) tumours. METHODS: Tumour samples from an olaparib maintenance monotherapy trial (Study 19, D0810C00019; NCT00753545) were analysed. Analyses included classification of mutations in genes involved in homologous recombination repair (HRR), BRCA1 promoter methylation status, measurement of BRCA1 protein and Myriad HRD score. RESULTS: Patients with BRCAm tumours gained most benefit from olaparib; a similar treatment benefit was also observed in 21/95 patients whose tumours were BRCAwt but had loss-of-function HRR mutations compared to patients with no detectable HRR mutations (58/95). A higher median Myriad MyChoice® HRD score was observed in BRCAm and BRCAwt tumours with BRCA1 methylation. Patients without BRCAm tumours derived benefit from olaparib treatment vs placebo although to a lesser extent than BRCAm patients.CONCLUSIONS: Ovarian cancer patients with tumours harbouring loss-of-function mutations in HRR genes other than BRCA1/2 may constitute a small, molecularly identifiable and clinically relevant population who derive treatment benefit from olaparib similar to patients with BRCAm

    Ten Years of Surveillance for Invasive Streptococcus pneumoniae during the Era of Antiretroviral Scale-Up and Cotrimoxazole Prophylaxis in Malawi

    Get PDF
    OBJECTIVE: To document trends in invasive pneumococcal disease (IPD) in a central hospital in Malawi during the period of national scale-up of antiretroviral therapy (ART) and cotrimoxazole prophylaxis. METHODS: Between 1 January 2000 and 31 December 2009 almost 100,000 blood cultures and 40,000 cerebrospinal fluid (CSF) cultures were obtained from adults and children admitted to the Queen Elizabeth Central Hospital, Blantyre, Malawi with suspected severe bacterial infection. RESULTS: 4,445 pneumococcal isolates were obtained over the 10 year period. 1,837 were from children: 885 (19.9%) from blood and 952 (21.4%) from CSF. 2,608 were from adults: 1,813 (40.8%) from blood and 795 (17.9%) from CSF. At the start of the surveillance period cotrimoxazole resistance was 73.8% and at the end was 92.6%. Multidrug resistance (MDR) was present in almost one third of isolates and was constant over time. Free ART was introduced in Malawi in 2004. From 2005 onwards there was a decline in invasive pneumococcal infections with a negative correlation between ART scale-up and the decline in IPD (Pearson's correlation r = -0.91; p<0.001). CONCLUSION: During 2004-2009, national ART scale-up in Malawi was associated with a downward trend in IPD at QECH. The introduction of cotrimoxazole prophylaxis in HIV-infected groups has not coincided with a further increase in pneumococcal cotrimoxazole or multidrug resistance. These data highlight the importance of surveillance for high disease burden infections such as IPD in the region, which will be vital for monitoring pneumococcal conjugate vaccine introduction into national immunisation programmes
    corecore