9 research outputs found

    Epigenome roadmap test data

    No full text
    H3K27me3. Belongs to Epigenome Roadma

    Hacat H3K4me3, H3K27me3 and Input bed files

    No full text
    Cite https://academic.oup.com/nar/article/41/5/2846/2414471 Aligned with: hisat2 --threads 24 -x /mnt/cargo/genomes/hisat2/hg38/genome --no-spliced-alignment -k 1 --no-discordant --no-mixed -U Satrom-chIP-05-Input_TGACCA_L002_R1_001.fast

    PyRanges: efficient comparison of genomic intervals in Python

    No full text
    Complex genomic analyses often use sequences of simple set operations like intersection, overlap and nearest on genomic intervals. These operations, coupled with some custom programming, allow a wide range of analyses to be performed. To this end, we have written PyRanges, a data structure for representing and manipulating genomic intervals and their associated data in Python. Run single threaded on binary set operations, PyRanges is in median 2.3–9.6 times faster than the popular R GenomicRanges library and is equally memory efficient; run multi-threaded on 8 cores, our library is up to 123 times faster. PyRanges is therefore ideally suited both for individual analyses and as a foundation for future genomic libraries in Python

    Joint changes in RNA, RNA polymerase II, and promoter activity through the cell cycle identify non-coding RNAs involved in proliferation

    No full text
    Proper regulation of the cell cycle is necessary for normal growth and development of all organisms. Conversely, altered cell cycle regulation often underlies proliferative diseases such as cancer. Long non-coding RNAs (lncRNAs) are recognized as important regulators of gene expression and are often found dysregulated in diseases, including cancers. However, identifying lncRNAs with cell cycle functions is challenging due to their often low and cell-type specific expression. We present a highly effective method that analyses changes in promoter activity, transcription, and RNA levels for identifying genes enriched for cell cycle functions. Specifically, by combining RNA sequencing with ChIP sequencing through the cell cycle of synchronized human keratinocytes, we identified 1009 genes with cell cycle-dependent expression and correlated changes in RNA polymerase II occupancy or promoter activity as measured by histone 3 lysine 4 trimethylation (H3K4me3). These genes were highly enriched for genes with known cell cycle functions and included 57 lncRNAs. We selected four of these lncRNAs—SNHG26, EMSLR, ZFAS1, and EPB41L4A-AS1—for further experimental validation and found that knockdown of each of the four lncRNAs affected cell cycle phase distributions and reduced proliferation in multiple cell lines. These results show that many genes with cell cycle functions have concomitant cell-cycle dependent changes in promoter activity, transcription, and RNA levels and support that our multi-omics method is well suited for identifying lncRNAs involved in the cell cycle

    Bioconda: Sustainable and comprehensive software distribution for the life sciences

    No full text

    Bioconda: A sustainable and comprehensive software distribution for the life sciences

    No full text
    We present Bioconda (https://bioconda.github.io), a distribution of bioinformatics software for the lightweight, multi-platform and language-agnostic package manager Conda. Currently, Bioconda offers a collection of over 3000 software packages, which is continuously maintained, updated, and extended by a growing global community of more than 200 contributors. Bioconda improves analysis reproducibility by allowing users to define isolated environments with defined software versions, all of which are easily installed and managed without administrative privileges
    corecore