136 research outputs found
Clustering of cognate proteins among distinct proteomes derived from multiple links to a single seed sequence
<p>Abstract</p> <p>Background</p> <p>Modern proteomes evolved by modification of pre-existing ones. It is extremely important to comparative biology that related proteins be identified as members of the same cognate group, since a characterized putative homolog could be used to find clues about the function of uncharacterized proteins from the same group. Typically, databases of related proteins focus on those from completely-sequenced genomes. Unfortunately, relatively few organisms have had their genomes fully sequenced; accordingly, many proteins are ignored by the currently available databases of cognate proteins, despite the high amount of important genes that are functionally described only for these incomplete proteomes.</p> <p>Results</p> <p>We have developed a method to cluster cognate proteins from multiple organisms beginning with only one sequence, through connectivity saturation with that Seed sequence. We show that the generated clusters are in agreement with some other approaches based on full genome comparison.</p> <p>Conclusion</p> <p>The method produced results that are as reliable as those produced by conventional clustering approaches. Generating clusters based only on individual proteins of interest is less time consuming than generating clusters for whole proteomes. </p
Transcriptional mechanisms underlying sensitization of peripheral sensory neurons by Granulocyte-/Granulocyte-macrophage colony stimulating factors
Background: Cancer-associated pain is a major cause of poor quality of life in cancer patients and is frequently resistant to conventional therapy. Recent studies indicate that some hematopoietic growth factors, namely granulocyte macrophage colony stimulating factor (GMCSF) and granulocyte colony stimulating factor (GCSF), are abundantly released in the tumor microenvironment and play a key role in regulating tumor-nerve interactions and tumor-associated pain by activating receptors on dorsal root ganglion (DRG) neurons. Moreover, these hematopoietic factors have been highly implicated in postsurgical pain, inflammatory pain and osteoarthritic pain. However, the molecular mechanisms via which G-/GMCSF bring about nociceptive sensitization and elicit pain are not known. Results: In order to elucidate G-/GMCSF mediated transcriptional changes in the sensory neurons, we performed a comprehensive, genome-wide analysis of changes in the transcriptome of DRG neurons brought about by exposure to GMCSF or GCSF. We present complete information on regulated genes and validated profiling analyses and report novel regulatory networks and interaction maps revealed by detailed bioinformatics analyses. Amongst these, we validate calpain 2, matrix metalloproteinase 9 (MMP9) and a RhoGTPase Rac1 as well as Tumor necrosis factor alpha (TNFĪ±) as transcriptional targets of G-/GMCSF and demonstrate the importance of MMP9 and Rac1 in GMCSF-induced nociceptor sensitization. Conclusion: With integrative approach of bioinformatics, in vivo pharmacology and behavioral analyses, our results not only indicate that transcriptional control by G-/GMCSF signaling regulates a variety of established pain modulators, but also uncover a large number of novel targets, paving the way for translational analyses in the context of pain disorders
Arena3D: visualization of biological networks in 3D
<p>Abstract</p> <p>Background</p> <p>Complexity is a key problem when visualizing biological networks; as the number of entities increases, most graphical views become incomprehensible. Our goal is to enable many thousands of entities to be visualized meaningfully and with high performance.</p> <p>Results</p> <p>We present a new visualization tool, Arena3D, which introduces a new concept of staggered layers in 3D space. Related data ā such as proteins, chemicals, or pathways ā can be grouped onto separate layers and arranged via layout algorithms, such as Fruchterman-Reingold, distance geometry, and a novel hierarchical layout. Data on a layer can be clustered via k-means, affinity propagation, Markov clustering, neighbor joining, tree clustering, or UPGMA ('unweighted pair-group method with arithmetic mean'). A simple input format defines the name and URL for each node, and defines connections or similarity scores between pairs of nodes. The use of Arena3D is illustrated with datasets related to Huntington's disease.</p> <p>Conclusion</p> <p>Arena3D is a user friendly visualization tool that is able to visualize biological or any other network in 3D space. It is free for academic use and runs on any platform. It can be downloaded or lunched directly from <url>http://arena3d.org</url>. Java3D library and Java 1.5 need to be pre-installed for the software to run.</p
FAIR data management: what does it mean for drug discovery?
The drug discovery community faces high costs in bringing safe and effective medicines to market, in part due to the rising volume and complexity of data which must be generated during the research and development process. Fully utilising these expensively created experimental and computational data resources has become a key aim of scientists due to the clear imperative to leverage the power of artificial intelligence (AI) and machine learning-based analyses to solve the complex problems inherent in drug discovery. In turn, AI methods heavily rely on the quantity, quality, consistency, and scope of underlying training data. While pre-existing preclinical and clinical data cannot fully replace the need for de novo data generation in a project, having access to relevant historical data represents a valuable asset, as its reuse can reduce the need to perform similar experiments, therefore avoiding a āreinventing the wheelā scenario. Unfortunately, most suitable data resources are often archived within institutes, companies, or individual research groups and hence unavailable to the wider community. Hence, enabling the data to be Findable, Accessible, Interoperable, and Reusable (FAIR) is crucial for the wider community of drug discovery and development scientists to learn from the work performed and utilise the findings to enhance comprehension of their own research outcomes. In this mini-review, we elucidate the utility of FAIR data management across the drug discovery pipeline and assess the impact such FAIR data has made on the drug development process
Martini: using literature keywords to compare gene sets
Life scientists are often interested to compare two gene sets to gain insight into differences between two distinct, but related, phenotypes or conditions. Several tools have been developed for comparing gene sets, most of which find Gene Ontology (GO) terms that are significantly over-represented in one gene set. However, such tools often return GO terms that are too generic or too few to be informative. Here, we present Martini, an easy-to-use tool for comparing gene sets. Martini is based, not on GO, but on keywords extracted from Medline abstracts; Martini also supports a much wider range of species than comparable tools. To evaluate Martini we created a benchmark based on the human cell cycle, and we tested several comparable tools (CoPub, FatiGO, Marmite and ProfCom). Martini had the best benchmark performance, delivering a more detailed and accurate description of function. Martini also gave best or equal performance with three other datasets (related to Arabidopsis, melanoma and ovarian cancer), suggesting that Martini represents an advance in the automated comparison of gene sets. In agreement with previous studies, our results further suggest that literature-derived keywords are a richer source of gene-function information than GO annotations. Martini is freely available at http://martini.embl.de
SMASCH: Facilitating multi-appointment scheduling in longitudinal clinical research studies and care programs
Longitudinal clinical research studies require conducting various assessments over long periods of time. Such assessments comprise numerous stages, requiring different resources defined by multidisciplinary research staff and aligned with available infrastructure and equipment, altogether constrained by time. While it is possible to manage the allocation of resources manually, it is complex and error-prone. Efficient multi-appointment scheduling is essential to assist clinical teams, ensuring high participant retention and producing successful clinical studies, directly impacting patient throughput and satisfaction. We present Smart Scheduling (SMASCH) system [1], a web application for multi-appointment scheduling management aiming to reduce times, optimise resources and secure personal identifiable information. SMASCH facilitates clinical research and integrated care programs in Luxembourg, providing features to better manage multi-appointment scheduling problems (MASPs) characteristic of longitudinal clinical research studies and speed up management tasks. It is present in multiple clinical research and integrated care programs in Luxembourg since 2017, including Dementia Prevention Program, the study for Mild Cognitive Impairment and gut microbiome, and the National Centre of Excellence in Research on Parkinsonās disease [2] which encompasses the study for REM sleep behaviour disorder and the Luxembourg Parkinsonās Study. SMASCH is a free and open-source solution available both as a Linux package and Docker image
The SYSCID map: a graphical and computational resource of molecular mechanisms across rheumatoid arthritis, systemic lupus erythematosus and inflammatory bowel disease
Chronic inflammatory diseases (CIDs), including inflammatory bowel disease (IBD), rheumatoid arthritis (RA) and systemic lupus erythematosus (SLE) are thought to emerge from an impaired complex network of inter- and intracellular biochemical interactions among several proteins and small chemical compounds under strong influence of genetic and environmental factors. CIDs are characterised by shared and disease-specific processes, which is reflected by partially overlapping genetic risk maps and pathogenic cells (e.g., T cells). Their pathogenesis involves a plethora of intracellular pathways. The translation of the research findings on CIDs molecular mechanisms into effective treatments is challenging and may explain the low remission rates despite modern targeted therapies. Modelling CID-related causal interactions as networks allows us to tackle the complexity at a systems level and improve our understanding of the interplay of key pathways. Here we report the construction, description, and initial applications of the SYSCID map (https://syscid.elixir-luxembourg.org/), a mechanistic causal interaction network covering the molecular crosstalk between IBD, RA and SLE. We demonstrate that the map serves as an interactive, graphical review of IBD, RA and SLE molecular mechanisms, and helps to understand the complexity of omics data. Examples of such application are illustrated using transcriptome data from time-series gene expression profiles following anti-TNF treatment and data from genome-wide associations studies that enable us to suggest potential effects to altered pathways and propose possible mechanistic biomarkers of treatment response
SmartR: An open-source platform for interactive visual analytics for translational research data.
In translational research, efficient knowledge exchange between the different fields of expertise is crucial. An open platform that is capable of storing a multitude of data types such as clinical, pre-clinical, or OMICS data combined with strong visual analytical capabilities will significantly accelerate the scientific progress by making data more accessible and hypothesis generation easier. The open data warehouse tranSMART is capable of storing a variety of data types and has a growing user community including both academic institutions and pharmaceutical companies. tranSMART, however, currently lacks interactive and dynamic visual analytics and does not permit any post-processing interaction or exploration. For this reason, we developed SmartR , a plugin for tranSMART, that equips the platform not only with several dynamic visual analytical workflows, but also provides its own framework for the addition of new custom workflows. Modern web technologies such as D3.js or AngularJS were used to build a set of standard visualizations that were heavily improved with dynamic elements. Contact: [email protected]. Supplementary information: Supplementary data are available at Bioinformatics online. Availability: : The source code is licensed under the Apache 2.0 License and is freely available on GitHub: https://github.com/transmart/SmartR
- ā¦