Search CORE

237 research outputs found

Artificial intelligence (AI) in rare diseases: is the future brighter?

Author: Brasil Sandra
Ferreira Vanessa dos Reis
Francisco Rita
Pascoal Carlota
Valadão Matias Gonçalo
Videira P A
Publication venue: 'MDPI AG'
Publication date: 01/12/2019
Field of study

The amount of data collected and managed in (bio)medicine is ever-increasing. Thus, there is a need to rapidly and efficiently collect, analyze, and characterize all this information. Artificial intelligence (AI), with an emphasis on deep learning, holds great promise in this area and is already being successfully applied to basic research, diagnosis, drug discovery, and clinical trials. Rare diseases (RDs), which are severely underrepresented in basic and clinical research, can particularly benefit from AI technologies. Of the more than 7000 RDs described worldwide, only 5% have a treatment. The ability of AI technologies to integrate and analyze data from different sources (e.g., multi-omics, patient registries, and so on) can be used to overcome RDs' challenges (e.g., low diagnostic rates, reduced number of patients, geographical dispersion, and so on). Ultimately, RDs' AI-mediated knowledge could significantly boost therapy development. Presently, there are AI approaches being used in RDs and this review aims to collect and summarize these advances. A section dedicated to congenital disorders of glycosylation (CDG), a particular group of orphan RDs that can serve as a potential study model for other common diseases and RDs, has also been included.info:eu-repo/semantics/publishedVersio

Repositório Científico do Instituto Politécnico de Lisboa

Repositório da Universidade Nova de Lisboa

7th German Conference on Chemoinformatics: 25 CIC-Workshop : Goslar, Germany, 6 - 8 November 2011 ; meeting abstracts / Edited by Frank Oellien, Uli Fechner and Thomas Engel

Author: Engel Thomas
Fechner Uli
Oellien Frank
Publication venue
Publication date: 01/05/2012
Field of study

Hochschulschriftenserver - Universität Frankfurt am Main

Recommended from our members

Data exploration and knowledge extraction: their application to the study of endocrine disrupting chemicals

Author: Roncaglioni Alessandro
Publication venue
Publication date: 01/01/2008
Field of study

Interest in computer-aided methods for investigating the biological field has increased significantly. One method is Quantitative Structure-Activity Relationships (QSAR), a valuable technique for predicting the effects of a substance from its chemical structure. A challenging application of QSAR is in characterizing the (bio)activity profiles of chemicals. Endocrine disrupters (EDs) are exogenous substances interfering with the function of the endocrine system and represent an interesting field of application for in silico methods. EDs targets include nuclear receptors, particularly effects mediated by the oestrogen receptor (ER). They are also mentioned as substances requiring a more detailed control and specific authorisation within REACH, the new European legislation on chemicals. QSAR represents a challenging method to approach data gap about EDs since REACH substantially boosted interest on computational chemistry to replace experimental testing. This work: aimed to explore the status, availability and reliability of non-testing methods applied to endocrine disruption via oestrogen receptors and eventually to propose new models easily exploitable in regulatory contexts. The work evaluated existing QSAR models present in literature to assess their validity on the basis of the OECD principles for QSAR validation. Different kinds of models have been analysed and they were externally validated with new data found in the literature. Furthermore, new QSAR binary classifiers have been developed using different data mining techniques (e.g.: classification trees, fuzzy logic, neural networks) based on a very large and heterogeneous dataset of chemical compounds. The focus was given to both binding (RBA) and transcriptional activity (RA) better to characterize the effects of EDs. A possible combination of the models was also explored. A very good accuracy was achieved for both RA and RBA (>85%). These models can be a valuable complement to in vivo and in vitro studies in the toxicological characterisation of chemical compounds

Open Research Online (The Open University)

OpenGrey Repository

ANALYSIS AND SIMULATION OF TANDEM MASS SPECTROMETRY DATA

Author: Goldfarb Dennis
Publication venue: University of North Carolina at Chapel Hill Graduate School
Publication date: 01/01/2019
Field of study

This dissertation focuses on improvements to data analysis in mass spectrometry-based proteomics, which is the study of an organism’s full complement of proteins. One of the biggest surprises from the Human Genome Project was the relatively small number of genes (~20,000) encoded in our DNA. Since genes code for proteins, scientists expected more genes would be necessary to produce a diverse set of proteins to cover the many functions that support the complexity of life. Thus, there is intense interest in studying proteomics, including post-translational modifications (how proteins change after translation from their genes), and their interactions (e.g. proteins binding together to form complex molecular machines) to fill the void in molecular diversity. The goal of mass spectrometry in proteomics is to determine the abundance and amino acid sequence of every protein in a biological sample. A mass spectrometer can determine mass/charge ratios and abundance for fragments of short peptides (which are subsequences of a protein); sequencing algorithms determine which peptides are most likely to have generated the fragmentation patterns observed in the mass spectrum, and protein identity is inferred from the peptides. My work improves the computational tools for mass spectrometry by removing limitations on present algorithms, simulating mass spectroscopy instruments to facilitate algorithm development, and creating algorithms that approximate isotope distributions, deconvolve chimeric spectra, and predict protein-protein interactions. While most sequencing algorithms attempt to identify a single peptide per mass spectrum, multiple peptides are often fragmented together. Here, I present a method to deconvolve these chimeric mass spectra into their individual peptide components by examining the isotopic distributions of their fragments. First, I derived the equation to calculate the theoretical isotope distribution of a peptide fragment. Next, for cases where elemental compositions are not known, I developed methods to approximate the isotope distributions. Ultimately, I created a non-negative least squares model that deconvolved chimeric spectra and increased peptide-spectrum-matches by 15-30%. To improve the operation of mass spectrometer instruments, I developed software that simulates liquid chromatography-mass spectrometry data and the subsequent execution of custom data acquisition algorithms. The software provides an opportunity for researchers to test, refine, and evaluate novel algorithms prior to implementation on a mass spectrometer. Finally, I created a logistic regression classifier for predicting protein-protein interactions defined by affinity purification and mass spectrometry (APMS). The classifier increased the area under the receiver operating characteristic curve by 16% compared to previous methods. Furthermore, I created a web application to facilitate APMS data scoring within the scientific community.Doctor of Philosoph

Carolina Digital Repository

Analysing functional genomics data using novel ensemble, consensus and data fusion techniques

Author: Glaab Enrico
Publication venue
Publication date: 15/10/2011
Field of study

Motivation: A rapid technological development in the biosciences and in computer science in the last decade has enabled the analysis of high-dimensional biological datasets on standard desktop computers. However, in spite of these technical advances, common properties of the new high-throughput experimental data, like small sample sizes in relation to the number of features, high noise levels and outliers, also pose novel challenges. Ensemble and consensus machine learning techniques and data integration methods can alleviate these issues, but often provide overly complex models which lack generalization capability and interpretability. The goal of this thesis was therefore to develop new approaches to combine algorithms and large-scale biological datasets, including novel approaches to integrate analysis types from different domains (e.g. statistics, topological network analysis, machine learning and text mining), to exploit their synergies in a manner that provides compact and interpretable models for inferring new biological knowledge. Main results: The main contributions of the doctoral project are new ensemble, consensus and cross-domain bioinformatics algorithms, and new analysis pipelines combining these techniques within a general framework. This framework is designed to enable the integrative analysis of both large- scale gene and protein expression data (including the tools ArrayMining, Top-scoring pathway pairs and RNAnalyze) and general gene and protein sets (including the tools TopoGSA , EnrichNet and PathExpand), by combining algorithms for different statistical learning tasks (feature selection, classification and clustering) in a modular fashion. Ensemble and consensus analysis techniques employed within the modules are redesigned such that the compactness and interpretability of the resulting models is optimized in addition to the predictive accuracy and robustness. The framework was applied to real-word biomedical problems, with a focus on cancer biology, providing the following main results: (1) The identification of a novel tumour marker gene in collaboration with the Nottingham Queens Medical Centre, facilitating the distinction between two clinically important breast cancer subtypes (framework tool: ArrayMining) (2) The prediction of novel candidate disease genes for Alzheimer’s disease and pancreatic cancer using an integrative analysis of cellular pathway definitions and protein interaction data (framework tool: PathExpand, collaboration with the Spanish National Cancer Centre) (3) The prioritization of associations between disease-related processes and other cellular pathways using a new rule-based classification method integrating gene expression data and pathway definitions (framework tool: Top-scoring pathway pairs) (4) The discovery of topological similarities between differentially expressed genes in cancers and cellular pathway definitions mapped to a molecular interaction network (framework tool: TopoGSA, collaboration with the Spanish National Cancer Centre) In summary, the framework combines the synergies of multiple cross-domain analysis techniques within a single easy-to-use software and has provided new biological insights in a wide variety of practical settings

Nottingham eTheses

Analysing functional genomics data using novel ensemble, consensus and data fusion techniques

Author: Glaab Enrico
Publication venue
Publication date
Field of study

Nottingham ePrints

Bioinformatics

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

This book is divided into different research areas relevant in Bioinformatics such as biological networks, next generation sequencing, high performance computing, molecular modeling, structural bioinformatics, molecular modeling and intelligent data analysis. Each book section introduces the basic concepts and then explains its application to problems of great relevance, so both novice and expert readers can benefit from the information and research works presented here

Directory of Open Access Books (DOAB)

Advances in Antimicrobial Peptide Discovery via Machine Learning and Delivery via Nanotechnology

Author: Li Bingyun
Sowers Alexa
Wang Guangshun
Xing Malcolm
Publication venue: DigitalCommons@UNMC
Publication date: 01/01/2023
Field of study

Antimicrobial peptides (AMPs) have been investigated for their potential use as an alternative to antibiotics due to the increased demand for new antimicrobial agents. AMPs, widely found in nature and obtained from microorganisms, have a broad range of antimicrobial protection, allowing them to be applied in the treatment of infections caused by various pathogenic microorganisms. Since these peptides are primarily cationic, they prefer anionic bacterial membranes due to electrostatic interactions. However, the applications of AMPs are currently limited owing to their hemolytic activity, poor bioavailability, degradation from proteolytic enzymes, and high-cost production. To overcome these limitations, nanotechnology has been used to improve AMP bioavailability, permeation across barriers, and/or protection against degradation. In addition, machine learning has been investigated due to its time-saving and cost-effective algorithms to predict AMPs. There are numerous databases available to train machine learning models. In this review, we focus on nanotechnology approaches for AMP delivery and advances in AMP design via machine learning. The AMP sources, classification, structures, antimicrobial mechanisms, their role in diseases, peptide engineering technologies, currently available databases, and machine learning techniques used to predict AMPs with minimal toxicity are discussed in detail

University of Nebraska Medical Center Research: DigitalCommons@UNMC

Computer aided drug design: Drug target directed in silico approaches

Author: CHEN XIN
Publication venue
Publication date: 13/01/2004
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS

Molecular Science for Drug Development and Biomedicine

Author
Publication venue: MDPI - Multidisciplinary Digital Publishing Institute
Publication date: 22/10/2015
Field of study

With the avalanche of biological sequences generated in the postgenomic age, molecular science is facing an unprecedented challenge, i.e., how to timely utilize the huge amount of data to benefit human beings. Stimulated by such a challenge, a rapid development has taken place in molecular science, particularly in the areas associated with drug development and biomedicine, both experimental and theoretical. The current thematic issue was launched with the focus on the topic of “Molecular Science for Drug Development and Biomedicine”, in hopes to further stimulate more useful techniques and findings from various approaches of molecular science for drug development and biomedicine

Directory of Open Access Books (DOAB)