946 research outputs found
PETAL: a python tool for deep analysis of biological pathways.
Abstract
Summary
Although several bioinformatics tools have been developed to examine signaling pathways, little attention has been given to ever long-distance crosstalk mechanisms. Here, we developed PETAL, a Python tool that automatically explores and detects the most relevant nodes within a KEGG pathway, scanning and performing an in-depth search. PETAL can contribute to discovering novel therapeutic targets or biomarkers that are potentially hidden and not considered in the network under study.
Availabilityand implementation
PETAL is a freely available open-source software. It runs on all platforms that support Python3. The user manual and source code are accessible from https://github.com/Pex2892/PETAL
Distributed Computing in a Pandemic: A Review of Technologies Available for Tackling COVID-19
The current COVID-19 global pandemic caused by the SARS-CoV-2 betacoronavirus
has resulted in over a million deaths and is having a grave socio-economic
impact, hence there is an urgency to find solutions to key research challenges.
Much of this COVID-19 research depends on distributed computing. In this
article, I review distributed architectures -- various types of clusters, grids
and clouds -- that can be leveraged to perform these tasks at scale, at
high-throughput, with a high degree of parallelism, and which can also be used
to work collaboratively. High-performance computing (HPC) clusters will be used
to carry out much of this work. Several bigdata processing tasks used in
reducing the spread of SARS-CoV-2 require high-throughput approaches, and a
variety of tools, which Hadoop and Spark offer, even using commodity hardware.
Extremely large-scale COVID-19 research has also utilised some of the world's
fastest supercomputers, such as IBM's SUMMIT -- for ensemble docking
high-throughput screening against SARS-CoV-2 targets for drug-repurposing, and
high-throughput gene analysis -- and Sentinel, an XPE-Cray based system used to
explore natural products. Grid computing has facilitated the formation of the
world's first Exascale grid computer. This has accelerated COVID-19 research in
molecular dynamics simulations of SARS-CoV-2 spike protein interactions through
massively-parallel computation and was performed with over 1 million volunteer
computing devices using the Folding@home platform. Grids and clouds both can
also be used for international collaboration by enabling access to important
datasets and providing services that allow researchers to focus on research
rather than on time-consuming data-management tasks.Comment: 21 pages (15 excl. refs), 2 figures, 3 table
Recommended from our members
novoBreak: local assembly for breakpoint detection in cancer genomes.
We present novoBreak, a genome-wide local assembly algorithm that discovers somatic and germline structural variation breakpoints in whole-genome sequencing data. novoBreak consistently outperformed existing algorithms on real cancer genome data and on synthetic tumors in the ICGC-TCGA DREAM 8.5 Somatic Mutation Calling Challenge primarily because it more effectively utilized reads spanning breakpoints. novoBreak also demonstrated great sensitivity in identifying short insertions and deletions
A big data approach for sequences indexing on the cloud via burrows wheeler transform
Indexing sequence data is important in the context of Precision Medicine, where large amounts of "omics"data have to be daily collected and analyzed in order to categorize patients and identify the most effective therapies. Here we propose an algorithm for the computation of Burrows Wheeler transform relying on Big Data technologies, i.e., Apache Spark and Hadoop. Our approach is the first that distributes the index computation and not only the input dataset, allowing to fully benefit of the available cloud resources. Copyright © 2020 for this paper by its authors
RNAseq Analyses Identify Tumor Necrosis Factor-Mediated Inflammation as a Major Abnormality in ALS Spinal Cord
ALS is a rapidly progressive, devastating neurodegenerative illness of adults that produces disabling weakness and spasticity arising from death of lower and upper motor neurons. No meaningful therapies exist to slow ALS progression, and molecular insights into pathogenesis and progression are sorely needed. In that context, we used high-depth, next generation RNA sequencing (RNAseq, Illumina) to define gene network abnormalities in RNA samples depleted of rRNA and isolated from cervical spinal cord sections of 7 ALS and 8 CTL samples. We aligned \u3e50 million 2X150 bp paired-end sequences/sample to the hg19 human genome and applied three different algorithms (Cuffdiff2, DEseq2, EdgeR) for identification of differentially expressed genes (DEGâs). Ingenuity Pathways Analysis (IPA) and Weighted Gene Co-expression Network Analysis (WGCNA) identified inflammatory processes as significantly elevated in our ALS samples, with tumor necrosis factor (TNF) found to be a major pathway regulator (IPA) and TNFα-induced protein 2 (TNFAIP2) as a major network âhubâ gene (WGCNA). Using the oPOSSUM algorithm, we analyzed transcription factors (TF) controlling expression of the nine DEG/hub genes in the ALS samples and identified TFâs involved in inflammation (NFkB, REL, NFkB1) and macrophage function (NR1H2::RXRA heterodimer). Transient expression in human iPSC-derived motor neurons of TNFAIP2 (also a DEG identified by all three algorithms) reduced cell viability and induced caspase 3/7 activation. Using high-density RNAseq, multiple algorithms for DEG identification, and an unsupervised gene co-expression network approach, we identified significant elevation of inflammatory processes in ALS spinal cord with TNF as a major regulatory molecule. Overexpression of the DEG TNFAIP2 in human motor neurons, the population most vulnerable to die in ALS, increased cell death and caspase 3/7 activation. We propose that therapies targeted to reduce inflammatory TNFα signaling may be helpful in ALS patients
Mutational signatures in esophageal adenocarcinoma define etiologically distinct subgroups with therapeutic relevance
Esophageal adenocarcinoma (EAC) has a poor outcome, and targeted therapy trials have thus far been disappointing owing to a lack of robust stratification methods. Whole-genome sequencing (WGS) analysis of 129 cases demonstrated that this is a heterogeneous cancer dominated by copy number alterations with frequent large-scale rearrangements. Co-amplification of receptor tyrosine kinases (RTKs) and/or downstream mitogenic activation is almost ubiquitous; thus tailored combination RTK inhibitor (RTKi) therapy might be required, as we demonstrate in vitro. However, mutational signatures showed three distinct molecular subtypes with potential therapeutic relevance, which we verified in an independent cohort (n = 87): (i) enrichment for BRCA signature with prevalent defects in the homologous recombination pathway; (ii) dominant T>G mutational pattern associated with a high mutational load and neoantigen burden; and (iii) C>A/T mutational pattern with evidence of an aging imprint. These subtypes could be ascertained using a clinically applicable sequencing strategy (low coverage) as a basis for therapy selection
A scale-out RDF molecule store for distributed processing of biomedical data
The computational analysis of protein-protein interaction and biomolecular pathway data paves the way to efficient in silico drug discovery and therapeutic target identification. However, relevant data sources are currently distributed across a wide range of disparate, large-scale, publicly-available databases and repositories and are described using a wide range of taxonomies and ontologies. Sophisticated integration, manipulation, processing and analysis of these datasets are required in order to reveal previously undiscovered interactions and pathways that will lead to the discovery of new drugs. The BioMANTA project focuses on utilizing Semantic Web technologies together with a scale-out architecture to tackle the above challenges and to provide efficient analysis, querying, and reasoning about protein-protein interaction data. This paper describes the initial results of the BioMANTA project. The fully-developed system will allow knowledge representation and processing that are not currently available in typical scale-out or Semantic Web databases. We present the design of the architecture, basic ontology and some implementation details that aim to provide efficient, scalable RDF storage and inferencing. The results of initial performance evaluation are also provided
Mutational signatures in esophageal adenocarcinoma define etiologically distinct subgroups with therapeutic relevance.
Esophageal adenocarcinoma (EAC) has a poor outcome, and targeted therapy trials have thus far been disappointing owing to a lack of robust stratification methods. Whole-genome sequencing (WGS) analysis of 129 cases demonstrated that this is a heterogeneous cancer dominated by copy number alterations with frequent large-scale rearrangements. Co-amplification of receptor tyrosine kinases (RTKs) and/or downstream mitogenic activation is almost ubiquitous; thus tailored combination RTK inhibitor (RTKi) therapy might be required, as we demonstrate in vitro. However, mutational signatures showed three distinct molecular subtypes with potential therapeutic relevance, which we verified in an independent cohort (n = 87): (i) enrichment for BRCA signature with prevalent defects in the homologous recombination pathway; (ii) dominant T>G mutational pattern associated with a high mutational load and neoantigen burden; and (iii) C>A/T mutational pattern with evidence of an aging imprint. These subtypes could be ascertained using a clinically applicable sequencing strategy (low coverage) as a basis for therapy selection.Whole-genome sequencing of esophageal adenocarcinoma samples was performed as part of the International Cancer Genome Consortium (ICGC) through the oEsophageal Cancer Clinical and Molecular Stratification (OCCAMS) Consortium and was funded by Cancer Research UK. We thank the ICGC members for their input on verification standards as part of the benchmarking exercise. We thank the Human Research Tissue Bank, which is supported by the National Institute for Health Research (NIHR) Cambridge Biomedical Research Centre, from Addenbrookeâs Hospital and UCL. Also the University Hospital of Southampton Trust and the Southampton, Birmingham, Edinburgh and UCL Experimental Cancer Medicine Centres and the QEHB charities. This study was partly funded by a project grant from Cancer Research UK. R.C.F. is funded by an NIHR Professorship and receives core funding from the Medical Research Council and infrastructure support from the Biomedical Research Centre and the Experimental Cancer Medicine Centre. We acknowledge the support of The University of Cambridge, Cancer Research UK (C14303/A17197) and Hutchison Whampoa Limited. We would like to thank Dr. Peter Van Loo for providing the NGS version of ASCAT for copy number calling. We are grateful to all the patients who provided written consent for participation in this study and the staff at all participating centres.
Some of the work was undertaken at UCLH/UCL who received a proportion of funding from the Department of Healthâs NIHR Biomedical Research Centres funding scheme. The work at UCLH/UCL was also supported by the CRUK UCL Early Cancer Medicine Centre.This is the author accepted manuscript. The final version is available from Nature Publishing Group via http://dx.doi.org/10.1038/ng.365
- âŠ