179 research outputs found

    Randomized QuickSort and the Entropy of the Random Source

    Get PDF
    The worst-case complexity of an implementation of Quicksort depends on the random number generator that is used to select the pivot elements. In this paper we estimate the expected number of comparisons of Quicksort as a function in the entropy of the random source. We give upper and lower bounds and show that the expected number of comparisons increases from nlognnlog n to n2n^2, if the entropy of the random source is bounded. As examples we show explicit bounds for distributions with bounded min-entropy and the geometrical distribution

    EpiGEN: an epistasis simulation pipeline

    Get PDF
    Abstract Summary Simulated data are crucial for evaluating epistasis detection tools in genome-wide association studies. Existing simulators are limited, as they do not account for linkage disequilibrium (LD), support limited interaction models of single nucleotide polymorphisms (SNPs) and only dichotomous phenotypes or depend on proprietary software. In contrast, EpiGEN supports SNP interactions of arbitrary order, produces realistic LD patterns and generates both categorical and quantitative phenotypes. Availability and implementation EpiGEN is implemented in Python 3 and is freely available at https://github.com/baumbachlab/epigen. Supplementary information Supplementary data are available at Bioinformatics online

    De novo pathway-based biomarker identification

    Get PDF
    Gene expression profiles have been extensively discussed as an aid to guide the therapy by predicting disease outcome for the patients suffering from complex diseases, such as cancer. However, prediction models built upon single-gene (SG) features show poor stability and performance on independent datasets. Attempts to mitigate these drawbacks have led to the development of network-based approaches that integrate pathway information to produce meta-gene (MG) features. Also, MG approaches have only dealt with the two-class problem of good versus poor outcome prediction. Stratifying patients based on their molecular subtypes can provide a detailed view of the disease and lead to more personalized therapies. We propose and discuss a novel MG approach based on de novo pathways, which for the first time have been used as features in a multi-class setting to predict cancer subtypes. Comprehensive evaluation in a large cohort of breast cancer samples from The Cancer Genome Atlas (TCGA) revealed that MGs are considerably more stable than SG models, while also providing valuable insight into the cancer hallmarks that drive them. In addition, when tested on an independent benchmark non-TCGA dataset, MG features consistently outperformed SG models. We provide an easy-touse web service at http:// pathclass. compbio. sdu. dk where users can upload their own gene expression datasets from breast cancer studies and obtain the subtype predictions from all the classifiers

    Comprehensive analysis of high-throughput screens with HiTSeekR

    Get PDF
    High-throughput screening (HTS) is an indispensable tool for drug (target) discovery that currently lacks user-friendly software tools for the robust identification of putative hits from HTS experiments and for the interpretation of these findings in the context of systems biology. We developed HiTSeekR as a one-stop solution for chemical compound screens, siRNA knock-down and CRISPR/Cas9 knock-out screens, as well as microRNA inhibitor and -mimics screens. We chose three use cases that demonstrate the potential of HiTSeekR to fully exploit HTS screening data in quite heterogeneous contexts to generate novel hypotheses for follow-up experiments: (i) a genome-wide RNAi screen to uncover modulators of TNFα, (ii) a combined siRNA and miRNA mimics screen on vorinostat resistance and (iii) a small compound screen on KRAS synthetic lethality. HiTSeekR is publicly available at http://hitseekr.compbio.sdu.dk. It is the first approach to close the gap between raw data processing, network enrichment and wet lab target generation for various HTS screen types

    Efficient Sample Tracking With OpenLabFramework

    Get PDF
    The advance of new technologies in biomedical research has led to a dramatic growth in experimental throughput. Projects therefore steadily grow in size and involve a larger number of researchers. Spreadsheets traditionally used are thus no longer suitable for keeping track of the vast amounts of samples created and need to be replaced with state-of-the-art laboratory information management systems. Such systems have been developed in large numbers, but they are often limited to specific research domains and types of data. One domain so far neglected is the management of libraries of vector clones and genetically engineered cell lines. OpenLabFramework is a newly developed web-application for sample tracking, particularly laid out to fill this gap, but with an open architecture allowing it to be extended for other biological materials and functional data. Its sample tracking mechanism is fully customizable and aids productivity further through support for mobile devices and barcoded labels

    Hepatocellular Carcinoma and Nuclear Paraspeckles: Induction in Chemoresistance and Prediction for Poor Survival

    Get PDF
    Background/Aims: Hepatocellular carcinoma (HCC) represents the second most common cause of cancer-related deaths worldwide, not least due to its high chemoresistance. The long non-coding RNA nuclear paraspeckle assembly transcript 1 (NEAT1), localised in nuclear paraspeckles, has been shown to enhance chemoresistance in several cancer types. Since data on NEAT1 in HCC chemosensitivity are completely lacking and chemoresistance is linked to poor prognosis, we aimed to study NEAT1 expression in HCC chemoresistance and its link to HCC prognosis. Methods: NEAT1 expression was determined in either sensitive, or sorafenib, or doxorubicin resistant HepG2, PLC/PRF/5, and Huh7 cells by qPCR. Paraspeckles were detected by immunostaining of paraspeckle component 1 (PSPC1) in cell culture and in a cohort of HCC patients. PSPC1 expression was correlated with clinical data. The expression of transcript variants of NEAT1 and transcripts encoding the paraspeckle-associated proteins was analysed in the TCGA liver cancer data set. Results: NEAT1 was overexpressed in all three sorafenib and doxorubicin resistant cell lines. Paraspeckles were present in all chemoresistant cells, whereas no signal was detected in the sensitive cells. Expression of NEAT1 transcripts as well as transcripts encoding PSPC1, NONO, and RBM14 was increased in tumour tissue. Expression of PSPC1, NONO, and RBM14 transcripts was significantly associated with poor survival, whereas NEAT1 expression was not. Immunohistochemical analysis revealed that nuclear and cytoplasmic PSPC1-positivity was significantly associated with shorter overall survival of HCC patients. Conclusion: Our data show an induction of NEAT1 in HCC chemoresistance and a high correlation of transcripts encoding paraspeckle-associated proteins with poor survival in HCC. Therefore, NEAT1, PSPC1, NONO, and RBM14 might be promising targets for novel HCC therapies, and the paraspeckle-associated proteins might be clinical markers and predictors for poor survival in HCC
    corecore